Field of view adjustment

ABSTRACT

Systems and methods are disclosed for field of view adjustment for image capture devices. For example, methods may include receiving a field of view selection; oversampling, using an image sensor, to obtain an image at a capture resolution that is greater than an encode resolution; determining a crop setting based on the field of view selection; cropping the image using the crop setting to obtain a cropped image; down-scaling the cropped image to obtain a scaled image at the encode resolution; encoding the scaled image at the encode resolution; and storing, displaying, or transmitting an output image based on the scaled image.

CROSS REFERENCE TO RELATED APPLICATION(S)

This application is a continuation of U.S. application Ser. No.17/113,522, filed on Dec. 7, 2020, now U.S. Pat. No. 11,323,628, whichis a continuation of U.S. application Ser. No. 16/106,607, filed on Aug.21, 2018, now U.S. Pat. No. 10,863,097, the entire disclosures of whichare incorporated by reference herein.

TECHNICAL FIELD

The present disclosure relates to field of view adjustment for imagecapture devices.

BACKGROUND

Image capture devices, such as cameras, may capture content as images(e.g., still images or frames of video). Light may be received andfocused via a lens and may be converted to an electronic image signal byan image sensor. The image signal may be processed by an image signalprocessor (ISP) to form an image, which may be stored and/or encoded. Insome implementations, multiple images or video frames may includespatially adjacent or overlapping content. Accordingly, systems,methods, and apparatus for capturing, processing, and/or encodingimages, video, or both may be advantageous.

SUMMARY

The present disclosure describes, inter alia, apparatus and methods forfield of view adjustment for image capture devices.

In a first aspect, the subject matter described in this specificationcan be embodied in systems that include an image sensor configured tocapture images, a touchscreen display configured to present a userinterface including a slider interface, and a processing apparatus thatis configured to: receive a field of view selection via the sliderinterface; oversample, using the image sensor, to obtain an image at acapture resolution that is greater than an encode resolution; determinea crop setting for a RAW domain crop based on the field of viewselection; crop the image in a RAW domain using the crop setting toobtain a cropped image; convert the cropped image from the RAW domain toa YUV domain; down-scale the cropped image in the YUV domain to obtain ascaled image at the encode resolution; encode the scaled image at theencode resolution to obtain an encoded image; and store or transmit theencoded image.

In a second aspect, the subject matter described in this specificationcan be embodied in methods that include receiving a field of viewselection; oversampling, using an image sensor, to obtain an image at acapture resolution that is greater than an encode resolution;determining a crop setting based on the field of view selection;cropping the image using the crop setting to obtain a cropped image;down-scaling the cropped image to obtain a scaled image at the encoderesolution; encoding the scaled image at the encode resolution; andstoring, displaying, or transmitting an output image based on the scaledimage.

In a third aspect, the subject matter described in this specificationcan be embodied in systems that include an image sensor configured tocapture images and a processing apparatus configured to: receive a fieldof view selection; oversample, using the image sensor, to obtain animage at a capture resolution that is greater than an encode resolution;determine a crop setting based on the field of view selection; crop theimage using the crop setting to obtain a cropped image; down-scale thecropped image to obtain a scaled image at the encode resolution; encodethe scaled image at the encode resolution to obtain an encoded image;and store or transmit the encoded image.

These and other aspects of the present disclosure are disclosed in thefollowing detailed description, the appended claims, and theaccompanying figures.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosed embodiments have other advantages and features which willbe more readily apparent from the detailed description, the appendedclaims, and the accompanying figures. A brief introduction of thefigures is below.

FIG. 1 is a diagram of an example of an image capture system for contentcapture.

FIG. 2 is a block diagram of an example of an image capture device.

FIG. 3 is a cross-sectional view of an example of an image captureapparatus including overlapping fields-of-view.

FIG. 4 is a block diagram of an example of an image processing andcoding pipeline.

FIG. 5 is a functional block diagram of an example of an image signalprocessor.

FIG. 6A is a block diagram of an example of a system configured forimage capture.

FIG. 6B is a block diagram of an example of a system configured forimage capture.

FIG. 7 is a block diagram of an example of an image processing pipelinewith an adjustable field of view for capturing images.

FIG. 8 is a block diagram of an example of an image processing pipelinewith an adjustable field of view for capturing images with a limited YUVdomain down-scaling ratio.

FIG. 9 is a flowchart of an example of a process for adjusting a fieldof view for an image capture device.

FIG. 10 is a flowchart of an example of a process for determining cropsettings based on a field of view selection.

All figures disclosed herein are © Copyright 2022 GoPro Inc. All rightsreserved.

DETAILED DESCRIPTION

Content, such as visual content, may be captured as one or more images(e.g., still images or video frames) by one or more image capturedevices, such as a camera or camera array. An image capture device mayinclude one or more lenses, image sensors, image signal processors,encoders, or combinations thereof. A lens may receive and focus light onan image sensor or sensors. An image sensor or sensors may sample thelight and generate an electronic image signal. An image signal processor(ISP) may receive the image signal from one or more sensors and mayprocess the image signal to generate an image, picture, or frame. Thegenerated images may be stored, such as in a memory of an image capturedevice, and/or sent to an encoder for encoding, which may includecompression. When image is captured by a sensor device, noise is presentand it is desirable to suppress this noise.

A smooth field of view adjustment feature is described herein that mayallow the user to smoothly change the field of view during capture andstill keep the encoded resolution and frame rate constant. This featuremay be implemented by cropping an oversampled (e.g., a 4K resolutionimage), which represents about 118.2 degree horizontal field of view(HFOV) to a desired field of view, and then scaling the cropped image toa target encode/record resolution. The process of scaling the croppedfield of view image to the target encode resolution may be critical toimage quality in a camera or other image capture device. If the processof scaling cropped resolution to target resolution involves up-scaling,then the resulting image may look soft and pixelated. Smooth field ofview adjustment implementations may use oversampled resolution fromsensor and avoid up-scaling in a digital image processing pipeline,which may preserve better image quality compare to digital zoom featureavailable in standard digital cameras/phones. For example, a widestfield of view for an image capture device may be ˜118 degrees HFOV, amedium field of view may be ˜85 degrees HFOV, and a narrow field of viewmay be ˜62 degrees HFOV. For example, this approach may be applied to1080p video at 120/60/30/24 frames per second and 1440p video with60/30/24 frames per second modes.

In some implementations, a smooth field of view adjustment (e.g., zoom)feature may be implemented while recording by dynamically programming aRAW crop, a YUV crop and/or a YUV scaler by tuning crop/scale ratio toavoid visible glitches to an end user.

The image quality improvement may be achieved in tandem with low powerpreview m for image capture device. For example, an image capture devicemay run low power preview mode most of the time. In low power previewmode, we use a RAW scaler to down-scale so much that for some field ofview settings, the images are up-scaled and hence image quality isreduced. Use of these techniques for smooth field of view adjustment maybe selectively initiated a user is actively engaging with a device forimage capture. For example, a low power preview mode may be run when auser is not using the camera but when the user touches the LCD screen,this event may provide an indication that the user is going to use thecamera for framing. For example, in this case an image processingpipeline may be dynamically switched to a configuration described hereinthat avoids substantial RAW scaling and achieves smooth field of viewadjustment/zoom using YUV crop and down-scaling only. In someimplementations, gyroscope data from an image capture device may providean indication when user picks up the camera instead of LCD touch.

Implementations are described in detail with reference to the drawings,which are provided as examples so as to enable those skilled in the artto practice the technology. The figures and examples are not meant tolimit the scope of the present disclosure to a single implementation orembodiment, and other implementations and embodiments are possible byway of interchange of, or combination with, some or all of the describedor illustrated elements. Wherever convenient, the same reference numberswill be used throughout the drawings to refer to same or like parts.

FIG. 1 is a diagram of an example of an image capture system 100 forcontent capture in accordance with implementations of this disclosure.As shown in FIG. 1, an image capture system 100 may include an imagecapture apparatus 110, an external user interface (UI) device 120, or acombination thereof.

In some implementations, the image capture apparatus 110 may be amulti-face apparatus and may include multiple image capture devices,such as image capture devices 130, 132, 134 as shown in FIG. 1, arrangedin a structure 140, such as a cube-shaped cage as shown. Although threeimage capture devices 130, 132, 134 are shown for simplicity in FIG. 1,the image capture apparatus 110 may include any number of image capturedevices. For example, the image capture apparatus 110 shown in FIG. 1may include six cameras, which may include the three image capturedevices 130, 132, 134 shown and three cameras not shown.

In some implementations, the structure 140 may have dimensions, such asbetween 25 mm and 150 mm. For example, the length of each side of thestructure 140 may be 105 mm. The structure 140 may include a mountingport 142, which may be removably attachable to a supporting structure,such as a tripod, a photo stick, or any other camera mount (not shown).The structure 140 may be a rigid support structure, such that therelative orientation of the image capture devices 130, 132, 134 of theimage capture apparatus 110 may be maintained in relatively static orfixed alignment, except as described herein.

The image capture apparatus 110 may obtain, or capture, image content,such as images, video, or both, with a 360° field-of-view, which may bereferred to herein as panoramic or spherical content. For example, eachof the image capture devices 130, 132, 134 may include respectivelenses, for receiving and focusing light, and respective image sensorsfor converting the received and focused light to an image signal, suchas by measuring or sampling the light, and the multiple image capturedevices 130, 132, 134 may be arranged such that respective image sensorsand lenses capture a combined field-of-view characterized by a sphericalor near spherical field-of-view.

In some implementations, each of the image capture devices 130, 132, 134may have a respective field-of-view 170, 172, 174, such as afield-of-view 170, 172, 174 that 90° in a latitudinal dimension 180,182, 184 and includes 120° in a longitudinal dimension 190, 192, 194. Insome implementations, image capture devices 130, 132, 134 havingoverlapping fields-of-view 170, 172, 174, or the image sensors thereof,may be oriented at defined angles, such as at 90°, with respect to oneanother. In some implementations, the image sensor of the image capturedevice 130 is directed along the X axis, the image sensor of the imagecapture device 132 is directed along the Y axis, and the image sensor ofthe image capture device 134 is directed along the Z axis. Therespective fields-of-view 170, 172, 174 for adjacent image capturedevices 130, 132, 134 may be oriented to allow overlap for a stitchingfunction. For example, the longitudinal dimension 190 of thefield-of-view 170 for the image capture device 130 may be oriented at90° with respect to the latitudinal dimension 184 of the field-of-view174 for the image capture device 134, the latitudinal dimension 180 ofthe field-of-view 170 for the image capture device 130 may be orientedat 90° with respect to the longitudinal dimension 192 of thefield-of-view 172 for the image capture device 132, and the latitudinaldimension 182 of the field-of-view 172 for the image capture device 132may be oriented at 90° with respect to the longitudinal dimension 194 ofthe field-of-view 174 for the image capture device 134.

The image capture apparatus 110 shown in FIG. 1 may have 420° angularcoverage in vertical and/or horizontal planes by the successive overlapof 90°, 120°, 90°, 120° respective fields-of-view 170, 172, 174 (not allshown) for four adjacent image capture devices 130, 132, 134 (not allshown). For example, fields-of-view 170, 172 for the image capturedevices 130, 132 and fields-of-view (not shown) for two image capturedevices (not shown) opposite the image capture devices 130, 132respectively may be combined to provide 420° angular coverage in ahorizontal plane. In some implementations, the overlap betweenfields-of-view of image capture devices 130, 132, 134 having a combinedfield-of-view including less than 360° angular coverage in a verticaland/or horizontal plane may be aligned and merged or combined to producea panoramic image. For example, the image capture apparatus 110 may bein motion, such as rotating, and source images captured by at least oneof the image capture devices 130, 132, 134 may be combined to form apanoramic image. As another example, the image capture apparatus 110 maybe stationary, and source images captured contemporaneously by eachimage capture device 130, 132, 134 may be combined to form a panoramicimage.

In some implementations, an image capture device 130, 132, 134 mayinclude a lens 150, 152, 154 or other optical element. An opticalelement may include one or more lens, macro lens, zoom lens,special-purpose lens, telephoto lens, prime lens, achromatic lens,apochromatic lens, process lens, wide-angle lens, ultra-wide-angle lens,fisheye lens, infrared lens, ultraviolet lens, perspective control lens,other lens, and/or other optical element. In some implementations, alens 150, 152, 154 may be a fisheye lens and produce fisheye, ornear-fisheye, field-of-view images. For example, the respective lenses150, 152, 154 of the image capture devices 130, 132, 134 may be fisheyelenses. In some implementations, images captured by two or more imagecapture devices 130, 132, 134 of the image capture apparatus 110 may becombined by stitching or merging fisheye projections of the capturedimages to produce an equirectangular planar image. For example, a firstfisheye image may be a round or elliptical image, and may be transformedto a first rectangular image, a second fisheye image may be a round orelliptical image, and may be transformed to a second rectangular image,and the first and second rectangular images may be arrangedside-by-side, which may include overlapping, and stitched together toform the equirectangular planar image.

Although not expressly shown in FIG. 1, In some implementations, animage capture device 130, 132, 134 may include one or more imagesensors, such as a charge-coupled device (CCD) sensor, an active pixelsensor (APS), a complementary metal-oxide semiconductor (CMOS) sensor,an N-type metal-oxide-semiconductor (NMOS) sensor, and/or any otherimage sensor or combination of image sensors.

Although not expressly shown in FIG. 1, in some implementations, animage capture apparatus 110 may include one or more microphones, whichmay receive, capture, and record audio information, which may beassociated with images acquired by the image sensors.

Although not expressly shown in FIG. 1, the image capture apparatus 110may include one or more other information sources or sensors, such as aninertial measurement unit (IMU), a global positioning system (GPS)receiver component, a pressure sensor, a temperature sensor, a heartrate sensor, or any other unit, or combination of units, that may beincluded in an image capture apparatus.

In some implementations, the image capture apparatus 110 may interfacewith or communicate with an external device, such as the external userinterface (UI) device 120, via a wired (not shown) or wireless (asshown) computing communication link 160. Although a single computingcommunication link 160 is shown in FIG. 1 for simplicity, any number ofcomputing communication links may be used. Although the computingcommunication link 160 shown in FIG. 1 is shown as a direct computingcommunication link, an indirect computing communication link, such as alink including another device or a network, such as the internet, may beused. In some implementations, the computing communication link 160 maybe a Wi-Fi link, an infrared link, a Bluetooth (BT) link, a cellularlink, a ZigBee link, a near field communications (NFC) link, such as anISO/IEC 23243 protocol link, an Advanced Network Technologyinteroperability (ANT+) link, and/or any other wireless communicationslink or combination of links. In some implementations, the computingcommunication link 160 may be an HDMI link, a USB link, a digital videointerface link, a display port interface link, such as a VideoElectronics Standards Association (VESA) digital display interface link,an Ethernet link, a Thunderbolt link, and/or other wired computingcommunication link.

In some implementations, the user interface device 120 may be acomputing device, such as a smartphone, a tablet computer, a phablet, asmart watch, a portable computer, and/or another device or combinationof devices configured to receive user input, communicate informationwith the image capture apparatus 110 via the computing communicationlink 160, or receive user input and communicate information with theimage capture apparatus 110 via the computing communication link 160.

In some implementations, the image capture apparatus 110 may transmitimages, such as panoramic images, or portions thereof, to the userinterface device 120 via the computing communication link 160, and theuser interface device 120 may store, process, display, or a combinationthereof the panoramic images.

In some implementations, the user interface device 120 may display, orotherwise present, content, such as images or video, acquired by theimage capture apparatus 110. For example, a display of the userinterface device 120 may be a viewport into the three-dimensional spacerepresented by the panoramic images or video captured or created by theimage capture apparatus 110.

In some implementations, the user interface device 120 may communicateinformation, such as metadata, to the image capture apparatus 110. Forexample, the user interface device 120 may send orientation informationof the user interface device 120 with respect to a defined coordinatesystem to the image capture apparatus 110, such that the image captureapparatus 110 may determine an orientation of the user interface device120 relative to the image capture apparatus 110. Based on the determinedorientation, the image capture apparatus 110 may identify a portion ofthe panoramic images or video captured by the image capture apparatus110 for the image capture apparatus 110 to send to the user interfacedevice 120 for presentation as the viewport. In some implementations,based on the determined orientation, the image capture apparatus 110 maydetermine the location of the user interface device 120 and/or thedimensions for viewing of a portion of the panoramic images or video.

In an example, a user may rotate (sweep) the user interface device 120through an arc or path 122 in space, as indicated by the arrow shown at122 in FIG. 1. The user interface device 120 may communicate displayorientation information to the image capture apparatus 110 using acommunication interface such as the computing communication link 160.The image capture apparatus 110 may provide an encoded bitstream toenable viewing of a portion of the panoramic content corresponding to aportion of the environment of the display location as the image captureapparatus 110 traverses the path 122. Accordingly, display orientationinformation from the user interface device 120 may be transmitted to theimage capture apparatus 110 to control user selectable viewing ofcaptured images and/or video.

In some implementations, the image capture apparatus 110 may communicatewith one or more other external devices (not shown) via wired orwireless computing communication links (not shown).

In some implementations, data, such as image data, audio data, and/orother data, obtained by the image capture apparatus 110 may beincorporated into a combined multimedia stream. For example, themultimedia stream may include a video track and/or an audio track. Asanother example, information from various metadata sensors and/orsources within and/or coupled to the image capture apparatus 110 may beprocessed to produce a metadata track associated with the video and/oraudio track. The metadata track may include metadata, such as whitebalance metadata, image sensor gain metadata, sensor temperaturemetadata, exposure time metadata, lens aperture metadata, bracketingconfiguration metadata and/or other parameters. In some implementations,a multiplexed stream may be generated to incorporate a video and/oraudio track and one or more metadata tracks.

In some implementations, the user interface device 120 may implement orexecute one or more applications, such as GoPro Studio, GoPro App, orboth, to manage or control the image capture apparatus 110. For example,the user interface device 120 may include an application for controllingcamera configuration, video acquisition, video display, or any otherconfigurable or controllable aspect of the image capture apparatus 110.

In some implementations, the user interface device 120, such as via anapplication (e.g., GoPro App), may generate and share, such as via acloud-based or social media service, one or more images, or short videoclips, such as in response to user input.

In some implementations, the user interface device 120, such as via anapplication (e.g., GoPro App), may remotely control the image captureapparatus 110, such as in response to user input.

In some implementations, the user interface device 120, such as via anapplication (e.g., GoPro App), may display unprocessed or minimallyprocessed images or video captured by the image capture apparatus 110contemporaneously with capturing the images or video by the imagecapture apparatus 110, such as for shot framing, which may be referredto herein as a live preview, and which may be performed in response touser input.

In some implementations, the user interface device 120, such as via anapplication (e.g., GoPro App), may mark one or more key momentscontemporaneously with capturing the images or video by the imagecapture apparatus 110, such as with a HiLight Tag, such as in responseto user input.

In some implementations, the user interface device 120, such as via anapplication (e.g., GoPro App), may display, or otherwise present, marksor tags associated with images or video, such as HiLight Tags, such asin response to user input. For example, marks may be presented in aGoPro Camera Roll application for location review and/or playback ofvideo highlights.

In some implementations, the user interface device 120, such as via anapplication (e.g., GoPro App), may wirelessly control camera software,hardware, or both. For example, the user interface device 120 mayinclude a web-based graphical interface accessible by a user forselecting a live or previously recorded video stream from the imagecapture apparatus 110 for display on the user interface device 120.

In some implementations, the user interface device 120 may receiveinformation indicating a user setting, such as an image resolutionsetting (e.g., 3840 pixels by 2160 pixels), a frame rate setting (e.g.,60 frames per second (fps)), a location setting, and/or a contextsetting, which may indicate an activity, such as mountain biking, inresponse to user input, and may communicate the settings, or relatedinformation, to the image capture apparatus 110.

FIG. 2 is a block diagram of an example of an image capture device 200in accordance with implementations of this disclosure. In someimplementations, an image capture device 200, such as one of the imagecapture devices 130, 132, 134 shown in FIG. 1, which may be an actioncamera, may include an audio component 210, a user interface (UI) unit212, an input/output (I/O) unit 214, a sensor controller 220, aprocessor 222, an electronic storage unit 224, an image sensor 230, ametadata unit 232, an optics unit 234, a communication unit 240, a powersystem 250, or a combination thereof.

In some implementations, the audio component 210, which may include amicrophone, may receive, sample, capture, record, or a combinationthereof audio information, such as sound waves, which may be associatedwith, such as stored in association with, image or video contentcontemporaneously captured by the image capture device 200. In someimplementations, audio information may be encoded using, e.g., AdvancedAudio Coding (AAC), Audio Compression—3 (AC3), Moving Picture ExpertsGroup Layer-3 Audio (MP3), linear Pulse Code Modulation (PCM), MotionPicture Experts Group—High efficiency coding and media delivery inheterogeneous environments (MPEG-H), and/or other audio coding formats(audio codecs). In one or more implementations of spherical video and/oraudio, the audio codec may include a three-dimensional audio codec, suchas Ambisonics. For example, an Ambisonics codec can produce fullsurround audio including a height dimension. Using a G-format Ambisonicscodec, a special decoder may be omitted.

In some implementations, the user interface unit 212 may include one ormore units that may register or receive input from and/or presentoutputs to a user, such as a display, a touch interface, a proximitysensitive interface, a light receiving/emitting unit, a soundreceiving/emitting unit, a wired/wireless unit, and/or other units. Insome implementations, the user interface unit 212 may include a display,one or more tactile elements (e.g., buttons and/or virtual touch screenbuttons), lights (LEDs), speakers, and/or other user interface elements.The user interface unit 212 may receive user input and/or provideinformation to a user related to the operation of the image capturedevice 200.

In some implementations, the user interface unit 212 may include adisplay unit that presents information related to camera control or use,such as operation mode information (e.g., image resolution, frame rate,capture mode, sensor mode, video mode, photo mode), connection statusinformation (e.g., connected, wireless, wired connection), power modeinformation (e.g., standby mode, sensor mode, video mode), informationrelated to other information sources (e.g., heart rate, GPS), and/orother information.

In some implementations, the user interface unit 212 may include a userinterface component such as one or more buttons, which may be operated,such as by a user, to control camera operations, such as to start, stop,pause, and/or resume sensor and/or content capture. The camera controlassociated with respective user interface operations may be defined. Forexample, the camera control associated with respective user interfaceoperations may be defined based on the duration of a button press (pulsewidth modulation), a number of button presses (pulse code modulation),or a combination thereof. In an example, a sensor acquisition mode maybe initiated in response to detecting two short button presses. Inanother example, the initiation of a video mode and cessation of a photomode, or the initiation of a photo mode and cessation of a video mode,may be triggered (toggled) in response to a single short button press.In another example, video or photo capture for a given time duration ora number of frames (burst capture) may be triggered in response to asingle short button press. Other user command or communicationimplementations may also be implemented, such as one or more short orlong button presses.

In some implementations, the I/O unit 214 may synchronize the imagecapture device 200 with other cameras and/or with other externaldevices, such as a remote control, a second image capture device, asmartphone, a user interface device, such as the user interface device120 shown in FIG. 1, and/or a video server. The I/O unit 214 maycommunicate information between I/O components. In some implementations,the I/O unit 214 may be connected to the communication unit 240 toprovide a wired and/or wireless communications interface (e.g., Wi-Fi,Bluetooth, USB, HDMI, Wireless USB, Near Field Communication (NFC),Ethernet, a radio frequency transceiver, and/or other interfaces) forcommunication with one or more external devices, such as a userinterface device, such as the user interface device 120 shown in FIG. 1,or another metadata source. In some implementations, the I/O unit 214may interface with LED lights, a display, a button, a microphone,speakers, and/or other I/O components. In some implementations, the I/Ounit 214 may interface with an energy source, e.g., a battery, and/or aDirect Current (DC) electrical source.

In some implementations, the I/O unit 214 of the image capture device200 may include one or more connections to external computerized devicesfor configuration and/or management of remote devices, as describedherein. The I/O unit 214 may include any of the wireless or wirelineinterfaces described herein, and/or may include customized orproprietary connections for specific applications.

In some implementations, the sensor controller 220 may operate orcontrol the image sensor 230, such as in response to input, such as userinput. In some implementations, the sensor controller 220 may receiveimage and/or video input from the image sensor 230 and may receive audioinformation from the audio component 210.

In some implementations, the processor 222 may include a system on achip (SOC), microcontroller, microprocessor, CPU, DSP,application-specific integrated circuit (ASIC), GPU, and/or otherprocessor that may control the operation and functionality of the imagecapture device 200. In some implementations, the processor 222 mayinterface with the sensor controller 220 to obtain and process sensoryinformation for, e.g., object detection, face tracking, stereo vision,and/or other image processing.

In some implementations, the sensor controller 220, the processor 222,or both may synchronize information received by the image capture device200. For example, timing information may be associated with receivedsensor data, and metadata information may be related to content(photo/video) captured by the image sensor 230 based on the timinginformation. In some implementations, the metadata capture may bedecoupled from video/image capture. For example, metadata may be storedbefore, after, and in-between the capture, processing, or storage of oneor more video clips and/or images.

In some implementations, the sensor controller 220, the processor 222,or both may evaluate or process received metadata and may generate othermetadata information. For example, the sensor controller 220 mayintegrate the received acceleration information to determine a velocityprofile for the image capture device 200 concurrent with recording avideo. In some implementations, video information may include multipleframes of pixels and may be encoded using an encoding method (e.g.,H.265, H.264, CineForm, and/or other codec).

Although not shown separately in FIG. 2, one or more of the audiocomponent 210, the user interface unit 212, the I/O unit 214, the sensorcontroller 220, the processor 222, the electronic storage unit 224, theimage sensor 230, the metadata unit 232, the optics unit 234, thecommunication unit 240, or the power systems 250 of the image capturedevice 200 may communicate information, power, or both with one or moreother units, such as via an electronic communication pathway, such as asystem bus. For example, the processor 222 may interface with the audiocomponent 210, the user interface unit 212, the I/O unit 214, the sensorcontroller 220, the electronic storage unit 224, the image sensor 230,the metadata unit 232, the optics unit 234, the communication unit 240,or the power systems 250 via one or more driver interfaces and/orsoftware abstraction layers. In some implementations, one or more of theunits shown in FIG. 2 may include a dedicated processing unit, memoryunit, or both (not shown). In some implementations, one or morecomponents may be operable by one or more other control processes. Forexample, a GPS receiver may include a processing apparatus that mayprovide position and/or motion information to the processor 222 inaccordance with a defined schedule (e.g., values of latitude, longitude,and elevation at 10 Hz).

In some implementations, the electronic storage unit 224 may include asystem memory module that may store executable computer instructionsthat, when executed by the processor 222, perform variousfunctionalities including those described herein. For example, theelectronic storage unit 224 may be a non-transitory computer-readablestorage medium, which may include executable instructions, and aprocessor, such as the processor 222 may execute the instruction toperform one or more, or portions of one or more, of the operationsdescribed herein. The electronic storage unit 224 may include storagememory for storing content (e.g., metadata, images, audio) captured bythe image capture device 200.

In some implementations, the electronic storage unit 224 may includenon-transitory memory for storing configuration information and/orprocessing code for video information and metadata capture, and/or toproduce a multimedia stream that may include video information andmetadata in accordance with the present disclosure. In someimplementations, the configuration information may include capture type(video, still images), image resolution, frame rate, burst setting,white balance, recording configuration (e.g., loop mode), audio trackconfiguration, and/or other parameters that may be associated withaudio, video, and/or metadata capture. In some implementations, theelectronic storage unit 224 may include memory that may be used by otherhardware/firmware/software elements of the image capture device 200.

In some implementations, the image sensor 230 may include one or more ofa charge-coupled device sensor, an active pixel sensor, a complementarymetal-oxide semiconductor sensor, an N-type metal-oxide-semiconductorsensor, and/or another image sensor or combination of image sensors. Insome implementations, the image sensor 230 may be controlled based oncontrol signals from a sensor controller 220.

The image sensor 230 may sense or sample light waves gathered by theoptics unit 234 and may produce image data or signals. The image sensor230 may generate an output signal conveying visual information regardingthe objects or other content corresponding to the light waves receivedby the optics unit 234. The visual information may include one or moreof an image, a video, and/or other visual information.

In some implementations, the image sensor 230 may include a videosensor, an acoustic sensor, a capacitive sensor, a radio sensor, avibrational sensor, an ultrasonic sensor, an infrared sensor, a radarsensor, a Light Detection And Ranging (LIDAR) sensor, a sonar sensor, orany other sensory unit or combination of sensory units capable ofdetecting or determining information in a computing environment.

In some implementations, the metadata unit 232 may include sensors suchas an IMU, which may include one or more accelerometers and/orgyroscopes, a magnetometer, a compass, a GPS sensor, an altimeter, anambient light sensor, a temperature sensor, and/or other sensors orcombinations of sensors. In some implementations, the image capturedevice 200 may contain one or more other metadata/telemetry sources,e.g., image sensor parameters, battery monitor, storage parameters,and/or other information related to camera operation and/or capture ofcontent. The metadata unit 232 may obtain information related to theenvironment of the image capture device 200 and aspects in which thecontent is captured.

For example, the metadata unit 232 may include an accelerometer that mayprovide device motion information including velocity and/or accelerationvectors representative of motion of the image capture device 200. Inanother example, the metadata unit 232 may include a gyroscope that mayprovide orientation information describing the orientation of the imagecapture device 200. In another example, the metadata unit 232 mayinclude a GPS sensor that may provide GPS coordinates, time, andinformation identifying a location of the image capture device 200. Inanother example, the metadata unit 232 may include an altimeter that mayobtain information indicating an altitude of the image capture device200.

In some implementations, the metadata unit 232, or one or more portionsthereof, may be rigidly coupled to the image capture device 200 suchthat motion, changes in orientation, or changes in the location of theimage capture device 200 may be accurately detected by the metadata unit232. Although shown as a single unit, the metadata unit 232, or one ormore portions thereof, may be implemented as multiple distinct units.For example, the metadata unit 232 may include a temperature sensor as afirst physical unit and a GPS unit as a second physical unit. In someimplementations, the metadata unit 232, or one or more portions thereof,may be included in an image capture device 200 as shown, or may beincluded in a physically separate unit operatively coupled to, such asin communication with, the image capture device 200.

In some implementations, the optics unit 234 may include one or more ofa lens, macro lens, zoom lens, special-purpose lens, telephoto lens,prime lens, achromatic lens, apochromatic lens, process lens, wide-anglelens, ultra-wide-angle lens, fisheye lens, infrared lens, ultravioletlens, perspective control lens, other lens, and/or other opticscomponent. In some implementations, the optics unit 234 may include afocus controller unit that may control the operation and configurationof the camera lens. The optics unit 234 may receive light from an objectand may focus received light onto an image sensor 230. Although notshown separately in FIG. 2, in some implementations, the optics unit 234and the image sensor 230 may be combined, such as in a combined physicalunit, such as a housing.

In some implementations, the communication unit 240 may be coupled tothe I/O unit 214 and may include a component (e.g., a dongle) having aninfrared sensor, a radio frequency transceiver and antenna, anultrasonic transducer, and/or other communications interfaces used tosend and receive wireless communication signals. In someimplementations, the communication unit 240 may include a local (e.g.,Bluetooth, Wi-Fi) and/or broad range (e.g., cellular LTE) communicationsinterface for communication between the image capture device 200 and aremote device (e.g., the user interface device 120 in FIG. 1). Thecommunication unit 240 may communicate using, for example, Ethernet,802.11, worldwide interoperability for microwave access (WiMAX), 3G,Long Term Evolution (LTE), digital subscriber line (DSL), asynchronoustransfer mode (ATM), InfiniBand, PCI Express Advanced Switching, and/orother communication technologies. In some implementations, thecommunication unit 240 may communicate using networking protocols, suchas multiprotocol label switching (MPLS), transmission controlprotocol/Internet protocol (TCP/IP), User Datagram Protocol (UDP),hypertext transport protocol (HTTP), simple mail transfer protocol(SMTP), file transfer protocol (FTP), and/or other networking protocols.

Information exchanged via the communication unit 240 may be representedusing formats including one or more of hypertext markup language (HTML),extensible markup language (XML), and/or other formats. One or moreexchanges of information between the image capture device 200 and remoteor external devices may be encrypted using encryption technologiesincluding one or more of secure sockets layer (SSL), transport layersecurity (TLS), virtual private networks (VPNs), Internet Protocolsecurity (IPsec), and/or other encryption technologies.

In some implementations, the one or more power systems 250 supply powerto the image capture device 200. For example, for a small-sized,lower-power action camera a wireless power solution (e.g., battery,solar cell, inductive (contactless) power source, rectification, and/orother power supply) may be used.

Consistent with the present disclosure, the components of the imagecapture device 200 may be remote from one another and/or aggregated. Forexample, one or more sensor components may be distal from the imagecapture device 200, e.g., such as shown and described with respect toFIG. 1. Multiple mechanical, sensory, or electrical units may becontrolled by a learning apparatus via network/radio connectivity.

FIG. 3 is a cross-sectional view of an example of a dual-lens imagecapture apparatus 300 including overlapping fields-of-view 310, 312 inaccordance with implementations of this disclosure. In someimplementations, the image capture apparatus 300 may be a sphericalimage capture apparatus with fields-of-view 310, 312 as shown in FIG. 3.For example, the image capture apparatus 300 may include image capturedevices 320, 322, related components, or a combination thereof, arrangedin a back-to-back or Janus configuration. For example, a first imagecapture device 320 may include a first lens 330 and a first image sensor340, and a second image capture device 322 may include a second lens 332and a second image sensor 342 arranged oppositely from the first lens330 and the first image sensor 340.

The first lens 330 of the image capture apparatus 300 may have thefield-of-view 310 shown above a boundary 350. Behind the first lens 330,the first image sensor 340 may capture a first hyper-hemispherical imageplane from light entering the first lens 330, corresponding to the firstfield-of-view 310.

The second lens 332 of the image capture apparatus 300 may have afield-of-view 312 as shown below a boundary 352. Behind the second lens332, the second image sensor 342 may capture a secondhyper-hemispherical image plane from light entering the second lens 332,corresponding to the second field-of-view 312.

In some implementations, one or more areas, such as blind spots 360,362, may be outside of the fields-of-view 310, 312 of the lenses 330,332, light may be obscured from the lenses 330, 332 and the respectiveimage sensors 340, 342, and content in the blind spots 360, 362 may beomitted from capture. In some implementations, the image captureapparatus 300 may be configured to minimize the blind spots 360, 362.

In some implementations, the fields-of-view 310, 312 may overlap. Stitchpoints 370, 372, proximal to the image capture apparatus 300, at whichthe fields-of-view 310, 312 overlap may be referred to herein as overlappoints or stitch points. Content captured by the respective lenses 330,332, distal to the stitch points 370, 372, may overlap.

In some implementations, images contemporaneously captured by therespective image sensors 340, 342 may be combined to form a combinedimage. Combining the respective images may include correlating theoverlapping regions captured by the respective image sensors 340, 342,aligning the captured fields-of-view 310, 312, and stitching the imagestogether to form a cohesive combined image.

In some implementations, a small change in the alignment (e.g., positionand/or tilt) of the lenses 330, 332, the image sensors 340, 342, or bothmay change the relative positions of their respective fields-of-view310, 312 and the locations of the stitch points 370, 372. A change inalignment may affect the size of the blind spots 360, 362, which mayinclude changing the size of the blind spots 360, 362 unequally.

In some implementations, incomplete or inaccurate information indicatingthe alignment of the image capture devices 320, 322, such as thelocations of the stitch points 370, 372, may decrease the accuracy,efficiency, or both of generating a combined image. In someimplementations, the image capture apparatus 300 may maintaininformation indicating the location and orientation of the lenses 330,332 and the image sensors 340, 342 such that the fields-of-view 310,312, stitch points 370, 372, or both may be accurately determined, whichmay improve the accuracy, efficiency, or both of generating a combinedimage.

In some implementations, optical axes through the lenses 330, 332 may besubstantially antiparallel to each other, such that the respective axesmay be within a tolerance such as 1%, 3%, 5%, 10%, and/or othertolerances. In some implementations, the image sensors 340, 342 may besubstantially perpendicular to the optical axes through their respectivelenses 330, 332, such that the image sensors may be perpendicular to therespective axes to within a tolerance such as 1%, 3%, 5%, 10%, and/orother tolerances.

In some implementations, the lenses 330, 332 may be laterally offsetfrom each other, may be off-center from a central axis of the imagecapture apparatus 300, or may be laterally offset and off-center fromthe central axis. As compared to an image capture apparatus withback-to-back lenses (e.g., lenses aligned along the same axis), theimage capture apparatus 300 including laterally offset lenses 330, 332may include substantially reduced thickness relative to the lengths ofthe lens barrels securing the lenses 330, 332. For example, the overallthickness of the image capture apparatus 300 may be close to the lengthof a single lens barrel as opposed to twice the length of a single lensbarrel as in a back-to-back configuration. Reducing the lateral distancebetween the lenses 330, 332 may improve the overlap in thefields-of-view 310, 312.

In some implementations, images or frames captured by an image captureapparatus, such as the image capture apparatus 110 shown in FIG. 1 orthe image capture apparatus 300 shown in FIG. 3, may be combined,merged, or stitched together, to produce a combined image, such as aspherical or panoramic image, which may be an equirectangular planarimage. In some implementations, generating a combined image may includethree-dimensional, or spatiotemporal, noise reduction (3DNR). In someimplementations, pixels along the stitching boundary may be matchedaccurately to minimize boundary discontinuities.

FIG. 4 is a block diagram of an example of an image processing andcoding pipeline 400 in accordance with implementations of thisdisclosure. In some implementations, the image processing and codingpipeline 400 may be included in an image capture device, such as theimage capture device 200 shown in FIG. 2, or an image capture apparatus,such as the image capture apparatus 110 shown in FIG. 1 or the imagecapture apparatus 300 shown in FIG. 3. In some implementations, theimage processing and coding pipeline 400 may include an image signalprocessor (ISP) 410, an encoder 420, or a combination thereof.

The image signal processor 410 may receive an input image signal 430.For example, an image sensor (not shown), such as image sensor 230 shownin FIG. 2, may capture an image, or a portion thereof, and may send, ortransmit, the captured image, or image portion, to the image signalprocessor 410 as the input image signal 430. In some implementations, animage, or frame, such as an image, or frame, included in the input imagesignal, may be one of a sequence or series of images or frames of avideo, such as a sequence, or series, of frames captured at a rate, orframe rate, which may be a number or cardinality of frames captured perdefined temporal period, such as 24, 30, or 60 frames per second.

The image signal processor 410 may include a local motion estimation(LME) unit 412, which may generate local motion estimation informationfor use in image signal processing and encoding, such as in correctingdistortion, stitching, and/or motion compensation. In someimplementations, the local motion estimation unit 412 may partition theinput image signal 430 into blocks (e.g., having 4×4, 16×16, 64×64,and/or other dimensions). In some implementations, the local motionestimation unit 412 may partition the input image signal 430 intoarbitrarily shaped patches and/or individual pixels.

The local motion estimation unit 412 may compare pixel values of blocksof pixels between image frames, such as successive image frames, fromthe input image signal 430 to determine displacement, or movement,between frames. The local motion estimation unit 412 may produce motionvectors (e.g., an x component and y component of motion) at multiplelocations within an image frame. The motion vectors may be representedby a translational model or other models that may approximate cameramotion, such as rotation and translation in three dimensions, andzooming.

The image signal processor 410 of the image processing and codingpipeline 400 may include electronic storage 414, such as memory (e.g.,random access memory (RAM), flash, or other types of memory). Theelectronic storage 414 may store local motion estimation information 416determined by the local motion estimation unit 412 for one or moreframes. The local motion estimation information 416 and associated imageor images may be passed as output 440 to the encoder 420. In someimplementations, the electronic storage 414 may include a buffer, orcache, and may buffer the input image signal as an input, or source,image, or frame.

The image signal processor 410 may output an image, associated localmotion estimation information 416, or both as the output 440. Forexample, the image signal processor 410 may receive the input imagesignal 430, process the input image signal 430, and output a processedimage as the output 440. Processing the input image signal 430 mayinclude generating and using the local motion estimation information416, spatiotemporal noise reduction (3DNR), dynamic range enhancement,local tone adjustment, exposure adjustment, contrast adjustment, imagestitching, and/or other operations.

The encoder 420 may encode or compress the output 440 of the imagesignal processor 410. In some implementations, the encoder 420 mayimplement the one or more encoding standards, which may include motionestimation.

The encoder 420 may output encoded video as an encoded output 450. Forexample, the encoder 420 may receive the output 440 of the image signalprocessor 410, which may include processed images, the local motionestimation information 416, or both. The encoder 420 may encode theimages and may output the encoded images as the encoded output 450.

The encoder 420 may include a motion estimation unit 422 that maydetermine motion information for encoding the image of output 440 of theimage signal processor 410. In some implementations, the encoder 420 mayencode the image of output 440 of the image signal processor 410 usingmotion information generated by the motion estimation unit 422 of theencoder 420, the local motion estimation information 416 generated bythe local motion estimation unit 412 of the image signal processor 410,or a combination thereof. For example, the motion estimation unit 422may determine motion information at pixel block sizes that may differfrom pixel block sizes used by the local motion estimation unit 412. Inanother example, the motion estimation unit 422 of the encoder 420 maygenerate motion information and the encoder may encode the image ofoutput 440 of the image signal processor 410 using the motioninformation generated by the motion estimation unit 422 of the encoder420 and the local motion estimation information 416 generated by thelocal motion estimation unit 412 of the image signal processor 410. Inanother example, the motion estimation unit 422 of the encoder 420 mayuse the local motion estimation information 416 generated by the localmotion estimation unit 412 of the image signal processor 410 as inputfor efficiently and accurately generating motion information.

The image signal processor 410, the encoder 420, or both may be distinctunits, as shown. For example, the image signal processor 410 may includea motion estimation unit, such as the local motion estimation unit 412as shown, and/or the encoder 420 may include a motion estimation unit,such as the motion estimation unit 422.

The image signal processor 410 may store motion information, such as thelocal motion estimation information 416, in a memory, such as theelectronic storage 414, and the encoder 420 may read the motioninformation from the electronic storage 414 or otherwise receive themotion information from the image signal processor 410. The encoder 420may use the motion estimation information determined by the image signalprocessor 410 for motion compensation processing.

FIG. 5 is a functional block diagram of an example of an image signalprocessor 500 in accordance with implementations of this disclosure. Animage signal processor 500 may be included in an image capture device,such as the image capture device 200 shown in FIG. 2, or an imagecapture apparatus, such as the image capture apparatus 110 shown in FIG.1 or the image capture apparatus 300 shown in FIG. 3. In someimplementations, the image signal processor 500 may be similar to theimage signal processor 410 shown in FIG. 4.

The image signal processor 500 may receive an image signal, such as froman image sensor (not shown), such as the image sensor 230 shown in FIG.2, in a defined format, such as a format of the image sensor, which maybe referred to herein as “RAW,” such as “a RAW image,” “RAW image data,”“RAW data,” “a RAW signal,” or “a RAW image signal.” For example, theRAW image signal may be in a format such as RGB format, which mayrepresent individual pixels using a combination of values or components,such as a red component (R), a green component (G), and a blue component(B). In some implementations, the image signal processor 500 may convertthe RAW image data (RGB data) to another format, such as a formatexpressing individual pixels using a combination of values orcomponents, such as a luminance, or luma, value (Y), a blue chrominance,or chroma, value (U or Cb), and a red chroma value (V or Cr), such asthe YUV or YCbCr formats.

The image signal processor 500 may include a front image signalprocessor (Front ISP) 510, or multiple front image signal processors asshown, a local motion estimation (LME) unit 520, a local motioncompensation (LMC) unit 522, a global motion compensation (GMC) unit524, a high dynamic range (HDR) unit 530, a three-dimensional noisereduction (3DNR) unit 540, which may include a temporal noise reduction(TNR) unit 542 and a RAW to RAW (R2R) unit 544, a RAW to YUV (R2Y) unit550, a YUV to YUV (Y2Y) unit 560, a warp and blend unit 570, a stitchingcost unit 580, a scaler 585, an image signal processor bus (ISP BUS)590, a configuration controller 595, or a combination thereof.

Although not shown expressly in FIG. 5, in some implementations, one ormore of the front image signal processor 510, the local motionestimation unit 520, the local motion compensation unit 522, the globalmotion compensation unit 524, the high dynamic range unit 530, thethree-dimensional noise reduction unit 540, the temporal noise reductionunit 542, the RAW to RAW unit 544, the RAW to YUV unit 550, the YUV toYUV unit 560, the warp and blend unit 570, the stitching cost unit 580,the scaler 585, the image signal processor bus 590, the configurationcontroller 595, or any combination thereof, may include a respectiveclock, power domain, or both.

In some implementations, the front image signal processor 510 mayminimally process image signals received from respective image sensors,which may include image scaling. Scaling, by the front image signalprocessor 510, may include processing pixels, such as a definedcardinality of pixels, corresponding to a determined quality. Forexample, the front image signal processor 510 may correct dead pixels,perform band processing, decouple vertical blanking, or a combinationthereof. In some implementations, the front image signal processor 510may output a high resolution frame, one or more downscaled, or reduced,resolution frames, such as a ½×½ resolution frame, a ¼×¼ resolutionframe, a ⅛×⅛ resolution frame, a 1/16× 1/16 resolution frame, a 1/32×1/32 resolution frame, or any combination thereof.

In some implementations, a multiple camera apparatus, such as the imagecapture apparatus 110 shown in FIG. 1, may include multiple imagecapture devices, such as the image capture device 200 shown in FIG. 2,and may include a respective front image signal processor 510 associatedwith each image capture device.

The local motion estimation unit 520 may receive a target image (e.g., atarget frame of video) and a reference image (e.g., a reference frame ofvideo) and determine motion information (e.g., a set of motion vectors)that may be used to determine a transformation that may be applied tothe reference image to better align image portions (e.g., pixels orblocks of pixels) of the reference image with corresponding imageportions of the target image.

The local motion estimation unit 520 may receive, or otherwise access, atarget image, or one or more portions thereof, which may be a currentinput frame, such as via the image signal processor bus 590. In someimplementations, the local motion estimation unit 520 may receive thetarget image, at a downscaled, or reduced, resolution. In someimplementations, such as implementations implementing high dynamic rangeimage processing, the target image may be a long exposure input frame.

The local motion estimation unit 520 may receive, or otherwise access, areference image, or one or more portions thereof, such as via the imagesignal processor bus 590. In some implementations, such asimplementations including temporal noise reduction, the reference imagemay be a recirculated frame that has been generated based on one or moreprevious frames of video from an image sensor. For example, thereference image may be a recirculated frame from the three-dimensionalnoise reduction unit 540. In some implementations, such asimplementations including high dynamic range image processing, thereference image may be a short exposure input frame corresponding to thelong exposure input frame that will be combined with the long exposureinput frame to obtain a high dynamic range frame.

In some implementations, the local motion estimation unit 520 mayreceive, or otherwise access, previously generated motion information,such as previously generated motion vectors for the target image ormotion information for a previously processed frame.

The local motion estimation unit 520 may determine motion information,such as motion vectors, representing motion between the reference imageand the target image, such as motion caused by moving objects in thefield-of-view or non-rotational motion, or translation, of thefield-of-view. The local motion estimation unit 520 may output themotion information. For example, the local motion estimation unit 520may output motion vectors to the local motion compensation unit 522.

The local motion compensation unit 522 may receive, or otherwise access,a reference image, or one or more portions thereof, such as via theimage signal processor bus 590. In some implementations, such asimplementations implementing temporal noise reduction processing, thereference image may be a recirculated frame (e.g., from thethree-dimensional noise reduction unit 540). In some implementations,such as implementations implementing high dynamic range imageprocessing, the reference image may be the short exposure input frame,where a corresponding long exposure frame has been used as the targetimage. In some implementations, such as implementations implementinghigh dynamic range image processing, the reference image may be a longexposure input frame, where a corresponding short exposure frame hasbeen used as the target image.

The local motion compensation unit 522 may receive, or otherwise access,motion information, such as motion vectors, associated with thereference image. For example, the local motion compensation unit 522 mayreceive the motion vectors from the local motion estimation unit 520.

The local motion compensation unit 522 may generate or obtain aprediction image (e.g., a prediction frame), or a portion thereof, suchas a prediction block, which may be a prediction of the target image, ora portion thereof, such as a target block of the target image, based onthe reference image, or a portion thereof, and the local motioninformation. For example, a prediction image may be obtained by applyinga transformation, which is based on the local motion information, to thereference image (e.g., a recirculated frame or a short exposure frame).The local motion compensation unit 522 may output a local motionprediction image, or one or more portions thereof, which may be referredto herein as a local motion compensated image (e.g., a local motioncompensated frame of video).

The global motion compensation unit 524 may receive, or otherwiseaccess, the reference image, or one or more portions thereof, such asvia the image signal processor bus 590. In some implementations, such asimplementations implementing temporal noise reduction processing, thereference image may be a recirculated frame (e.g., from thethree-dimensional noise reduction unit 540). In some implementations,such as implementations implementing high dynamic range imageprocessing, the reference image may be a short exposure input frame,where a corresponding long exposure input frame has been used as thetarget image. In some implementations, such as implementationsimplementing high dynamic range image processing, the reference imagemay be a long exposure input frame, where a corresponding short exposureinput frame has been used as the target image.

The global motion compensation unit 524 may receive, or otherwiseaccess, global motion information, such as global motion informationfrom a gyroscopic unit of the image capture apparatus, such as agyroscopic sensor included in the metadata unit 232 shown in FIG. 2,corresponding to a time period between capture of the reference imageand capture of the target image. The global motion information mayindicate a non-translational change in the orientation of thefield-of-view relative to the content captured in respective images. Forexample, the global motion information may indicate a horizontal changeof the field-of-view, which may indicate that the corresponding camerapanned, or rotated, around a vertical axis. In another example, theglobal motion information may indicate a vertical change of thefield-of-view, which may indicate that the camera tilted or rotatedaround an axis perpendicular to the lens. In another example, the globalmotion information may indicate a rotational change of the field-of-viewrelative to the horizon, which may indicate that the camera rolled orrotated around an axis parallel to the lens. The global motioninformation may be distinct from motion information, such as translationmotion information, indicating a change in the geospatial location ofthe image capture apparatus, which may include a change associated withchanging an elevation of the image capture apparatus.

The global motion compensation unit 524 may generate or obtain aprediction image (e.g., a prediction frame of video), or a portionthereof, such as a prediction block, which may be a prediction of thetarget image, or a portion thereof, such as a target block of the targetimage, based on the reference image, or a portion thereof, and theglobal motion information. For example, a prediction image may beobtained by applying a transformation, which is based on the globalmotion information, to the reference image (e.g., a recirculated frameor a short exposure frame). The global motion compensation unit 524 mayoutput a global motion prediction image, or one or more portionsthereof, which may be referred to herein as a global motion compensatedimage (e.g., a global motion compensated frame of video).

The high dynamic range unit 530 may receive, or otherwise access, (e.g.,from the front image signal processor 510) multiple images of a scenethat have been captured with different exposure times. The high dynamicrange unit 530 may combine the images captured with different exposuretimes to obtain a high dynamic range image. For example, the highdynamic range unit 530 may combine two images, a long exposure image anda short exposure image, to obtain a high dynamic range image. Forexample, image portions (e.g., pixels or blocks of pixels) of the highdynamic range image may be determined based on corresponding imageportions the short exposure image where the respective image portions ofthe long exposure image have saturated pixel values and may otherwisedetermine image portions of the high dynamic range based oncorresponding image portions the long exposure image. In someimplementations, motion compensation (e.g., local motion compensation bythe local motion compensation unit 522 and/or global motion compensationby the global motion compensation unit 524) may be applied to either thelong exposure image or the short exposure image to better align pixelscorresponding to objects appearing in the field of view of the two inputimages. For example, the high dynamic range unit 530 may combine a longexposure image with a motion compensated short exposure image. Forexample, the high dynamic range unit 530 may combine a short exposureimage with a motion compensated long exposure image. The high dynamicrange unit 530 may receive, or otherwise access, the local motionprediction image, or a portion thereof, from the local motioncompensation unit 522. The high dynamic range unit 530 may receive, orotherwise access, the global motion prediction image, or a portionthereof, from the global motion compensation unit 524.

The high dynamic range unit 530 may output the high dynamic range image.For example, the high dynamic range unit 530 may output the high dynamicrange image by storing the high dynamic range image in memory, such asshared memory, via the image signal processor bus 590, or the highdynamic range unit 530 may output the high dynamic range image directlyto another unit of the image signal processor 500, such as the temporalnoise reduction unit 542.

In some implementations, the high dynamic range unit 530 may be omitted,or high dynamic range processing by the high dynamic range unit 530 maybe omitted.

The three-dimensional noise reduction unit 540 may include the temporalnoise reduction (TNR) unit 542, the RAW to RAW (R2R) unit 544, or both.

The temporal noise reduction unit 542 may receive the current inputframe, or one or more portions thereof, such as from the front imagesignal processor 510 or via the image signal processor bus 590. In someimplementations, such as implementations implementing high dynamic rangeimage processing, the temporal noise reduction unit 542 may receive thehigh dynamic range input frame, or one or more portions thereof, such asfrom the high dynamic range unit 530, as the current input frame.

The temporal noise reduction unit 542 may receive, or otherwise access,a local motion prediction frame from the local motion compensation unit522. The temporal noise reduction unit 542 may receive, or otherwiseaccess, the global motion prediction frame from the global motioncompensation unit 524.

The temporal noise reduction unit 542 may reduce temporal noise in thecurrent input frame, which may include recursively reducing temporalnoise in a sequence of input images, such as a video. Recursive temporalnoise reduction may include combining a current image from a sequence ofimages (e.g., a current frame from a video) with a recirculated imagethat is based on one or more previous images from the sequence of imagesto obtain a noise reduced image. Details of this combination (e.g.,mixing weights for respective image portions) may be determined based onnoise level information (e.g., a noise map) for the recirculated image.

The temporal noise reduction unit 542 may generate output including apixel value and associated noise variance for the pixel value for one ormore pixels of the noise reduced image (e.g., the noise reduced frame).

The RAW to RAW unit 544 may perform spatial denoising of frames of RAWimages (e.g., frames of video). In some implementations, the RAW to RAWunit 544 may implement non-local means processing to reduce noise of aframe by determining weighted averages of pixels within the frame, wherethe weights depend on the similarity of the intensity or color betweenpixels. In some implementations, the weights may be based on noisevariance values received from the temporal noise reduction unit 542. Insome implementations, noise variance values for pixels of an image maybe updated based on weights (e.g., non-local means weights) used tocombine pixels of the image for spatial denoising, and these updatednoise variance values may be recirculated. Spatial denoising (e.g.,non-local means denoising) in the RAW to RAW unit 544 may includemultiple passes of image signal processing, including passes at variousresolutions.

The RAW to YUV unit 550 may demosaic, and/or color process, the framesof RAW images, which may include representing each pixel in the YUVformat, which may include a combination of a luminance (Y) component andtwo chrominance (UV) components.

The YUV to YUV unit 560 may perform local tone mapping of YUV images. Insome implementations, the YUV to YUV unit 560 may include multi-scalelocal tone mapping using a single pass approach or a multi-pass approachon a frame at different scales.

The warp and blend unit 570 may warp images, blend images, or both. Insome implementations, the warp and blend unit 570 may warp a coronaaround the equator of each frame to a rectangle. For example, the warpand blend unit 570 may warp a corona around the equator of each frame toa rectangle based on the corresponding low resolution frame generated bythe front image signal processor 510.

The warp and blend unit 570 may apply one or more transformations to theframes. In some implementations, spherical images produced by amulti-face camera device, such as the image capture apparatus 110 shownin FIG. 1 or the image capture apparatus 300 shown in FIG. 3, may bewarped and/or blended by the warp and blend unit 570 to correct fordistortions at image edges. In some implementations, the warp and blendunit 570 may apply a transformation that is subject to a close toidentity constraint, wherein a location of a pixel in an input image tothe warp and blend unit 570 may be similar to, such as within a defineddistance threshold of, a location of a corresponding pixel in an outputimage from the warp and blend unit 570. For example, the warp and blendunit 570 may include an internal memory, which may have a size, such as100 lines, which may be smaller than a size of a frame, and the warp andblend unit 570 may process the input image data in raster-in/raster-outorder using a transformation that is subject to a close to identityconstraint.

In some implementations, the warp and blend unit 570 may apply atransformation that is independent of close to identity constraints,which may include processing the input image data inraster-in/dynamic-out or dynamic-in/raster-out order. For example, thewarp and blend unit 570 may transform two or more non-rectilinear(fisheye) images to generate a combined frame, such as anequirectangular frame, by processing the input image data inraster-in/dynamic-out or dynamic-in/raster-out order.

The stitching cost unit 580 may generate a stitching cost map as anoutput. In some implementations, the cost map may be represented as arectangle having disparity x and longitude y based on a warping. Eachvalue of the cost map may be a cost function of a disparity x value fora corresponding longitude. Cost maps may be generated for variousscales, longitudes, and disparities.

The scaler 585 may scale images received from the output of the warp andblend unit 570, which may be in patches, or blocks, of pixels, such as16×16 blocks, 8×8 blocks, or patches or blocks of any other size orcombination of sizes.

The image signal processor bus 590 may be a bus or interconnect, such asan on-chip interconnect or embedded microcontroller bus interface, forcommunication between the front image signal processor 510, the temporalnoise reduction unit 542, the local motion compensation unit 522, theRAW to RAW unit 544, the RAW to YUV unit 550, the YUV to YUV unit 560,the combined warp and blend unit 570, the stitching cost unit 580, thescaler 585, the configuration controller 595, or any combinationthereof.

The configuration controller 595 may coordinate image processing by thefront image signal processor 510, the local motion estimation unit 520,the local motion compensation unit 522, the global motion compensationunit 524, the high dynamic range unit 530, the three-dimensional noisereduction unit 540, the temporal noise reduction unit 542, the RAW toRAW unit 544, the RAW to YUV unit 550, the YUV to YUV unit 560, the warpand blend unit 570, the stitching cost unit 580, the scaler 585, theimage signal processor bus 590, or any combination thereof, of the imagesignal processor 500. For example, the configuration controller 595 maycontrol camera alignment model calibration, auto-exposure, auto-whitebalance, or any other camera calibration or similar process orcombination of processes. In some implementations, the configurationcontroller 595 may be a microcontroller. The configuration controller595 is shown in FIG. 5 using broken lines to indicate that theconfiguration controller 595 may be included in the image signalprocessor 500 or may be external to, and in communication with, theimage signal processor 500. The configuration controller 595 may includea respective clock, power domain, or both.

FIG. 6A is a block diagram of an example of a system 600 configured forimage capture and denoising. The system 600 includes an image capturedevice 610 (e.g., a camera or a drone) that includes a processingapparatus 612 that is configured to receive a first image from a firstimage sensor 614 and receive a second image from a second image sensor616. The processing apparatus 612 may be configured to perform imagesignal processing (e.g., denoising, stitching, and/or encoding) togenerated composite images based on image data from the image sensors614 and 616. The image capture device 610 includes a communicationsinterface 618 for transferring images to other devices. The imagecapture device 610 includes a user interface 620, which may allow a userto control image capture functions and/or view images. The image capturedevice 610 includes a battery 622 for powering the image capture device610. The components of the image capture device 610 may communicate witheach other via a bus 624. The system 600 may be used to implementprocesses described in this disclosure, such as the process 900 of FIG.9 and/or the process 1000 of FIG. 10.

The processing apparatus 612 may include one or more processors havingsingle or multiple processing cores. The processing apparatus 612 mayinclude memory, such as random access memory device (RAM), flash memory,or any other suitable type of storage device such as a non-transitorycomputer readable memory. The memory of the processing apparatus 612 mayinclude executable instructions and data that can be accessed by one ormore processors of the processing apparatus 612. For example, theprocessing apparatus 612 may include one or more DRAM modules such asdouble data rate synchronous dynamic random-access memory (DDR SDRAM).In some implementations, the processing apparatus 612 may include adigital signal processor (DSP). In some implementations, the processingapparatus 612 may include an application specific integrated circuit(ASIC). For example, the processing apparatus 612 may include an imagesignal processor.

The first image sensor 614 and the second image sensor 616 areconfigured to capture images. The first image sensor 614 and the secondimage sensor 616 are configured to detect light of a certain spectrum(e.g., the visible spectrum or the infrared spectrum) and conveyinformation constituting an image as electrical signals (e.g., analog ordigital signals). For example, the image sensors 614 and 616 may includecharge-coupled devices (CCD) or active pixel sensors in complementarymetal-oxide-semiconductor (CMOS). The image sensors 614 and 616 maydetect light incident through respective lens (e.g., a fisheye lens). Insome implementations, the image sensors 614 and 616 include digital toanalog converters. In some implementations, the image sensors 614 and616 are held in a fixed orientation with respective fields of view thatoverlap. For example, the image sensors 614 and 616 may be configured tocapture image data using a plurality of selectable exposure times.

The image capture device 610 may include the communications interface618, which may enable communications with a personal computing device(e.g., a smartphone, a tablet, a laptop computer, or a desktopcomputer). For example, the communications interface 618 may be used toreceive commands controlling image capture and processing in the imagecapture device 610. For example, the communications interface 618 may beused to transfer image data to a personal computing device. For example,the communications interface 618 may include a wired interface, such asa high-definition multimedia interface (HDMI), a universal serial bus(USB) interface, or a FireWire interface. For example, thecommunications interface 618 may include a wireless interface, such as aBluetooth interface, a ZigBee interface, and/or a Wi-Fi interface.

The image capture device 610 may include the user interface 620. Forexample, the user interface 620 may include an LCD display forpresenting images and/or messages to a user. For example, the userinterface 620 may include a button or switch enabling a person tomanually turn the image capture device 610 on and off. For example, theuser interface 620 may include a shutter button for snapping pictures.In some implementations, the user interface 620 includes a touchscreendisplay. For example, the touchscreen display may be configured topresent a user interface including a slider interface for selecting anadjustable field of view for the image capture device 610. For example,a field of view selection may be received via the slider interface. Insome implementations, a low power preview mode may be terminatedresponsive to detection of a framing event based on touch data from thetouchscreen display.

The image capture device 610 may include the battery 622 that powers theimage capture device 610 and/or its peripherals. For example, thebattery 622 may be charged wirelessly or through a micro-USB interface.

The image capture device 610 may include other components not shown inFIG. 6A. For example, the image capture device 610 may include one ormore motion sensors, such as a gyroscope, and accelerometer, and/or amagnetometer. For example, a low power preview mode of the image capturedevice 610 may be terminated responsive to detection of a framing eventbased on angular rate data from the gyroscope.

FIG. 6B is a block diagram of an example of a system 630 configured forimage capture and denoising. The system 630 includes an image capturedevice 640 that communicates via a communications link 650 with apersonal computing device 660. The image capture device 640 includes afirst image sensor 642 and a second image sensor 644 that are configuredto capture respective images. The image capture device 640 includes acommunications interface 646 configured to transfer images via thecommunication link 650 to the personal computing device 660. Thepersonal computing device 660 includes a processing apparatus 662, auser interface 664, and a communications interface 666. The processingapparatus 662 is configured to receive, using the communicationsinterface 666, a first image from the first image sensor 642, andreceive a second image from the second image sensor 644. The processingapparatus 662 may be configured to perform image signal processing(e.g., denoising, stitching, and/or encoding) to generated compositeimages based on image data from the image sensors 642 and 644. Thesystem 630 may be used to implement processes described in thisdisclosure, such as the process 900 of FIG. 9 and/or the process 1000 ofFIG. 10.

The first image sensor 642 and the second image sensor 644 areconfigured to capture images. The first image sensor 642 and the secondimage sensor 644 are configured to detect light of a certain spectrum(e.g., the visible spectrum or the infrared spectrum) and conveyinformation constituting an image as electrical signals (e.g., analog ordigital signals). For example, the image sensors 642 and 644 may includecharge-coupled devices (CCD) or active pixel sensors in complementarymetal-oxide-semiconductor (CMOS). The image sensors 642 and 644 maydetect light incident through respective lens (e.g., a fisheye lens). Insome implementations, the image sensors 642 and 644 include digital toanalog converters. In some implementations, the image sensors 642 and644 are held in a fixed relative orientation with respective fields ofview that overlap. For example, the image sensors 642 and 644 may beconfigured to capture image data using a plurality of selectableexposure times. Image signals from the image sensors 642 and 644 may bepassed to other components of the image capture device 640 via a bus648.

The communications link 650 may be wired communications link or awireless communications link. The communications interface 646 and thecommunications interface 666 may enable communications over thecommunications link 650. For example, the communications interface 646and the communications interface 666 may include a high-definitionmultimedia interface (HDMI), a universal serial bus (USB) interface, aFireWire interface, a Bluetooth interface, a ZigBee interface, and/or aWi-Fi interface. For example, the communications interface 646 and thecommunications interface 666 may be used to transfer image data from theimage capture device 640 to the personal computing device 660 for imagesignal processing (e.g., denoising, stitching, and/or encoding) togenerated composite images based on image data from the image sensors642 and 644.

The processing apparatus 662 may include one or more processors havingsingle or multiple processing cores. The processing apparatus 662 mayinclude memory, such as random access memory device (RAM), flash memory,or any other suitable type of storage device such as a non-transitorycomputer readable memory. The memory of the processing apparatus 662 mayinclude executable instructions and data that can be accessed by one ormore processors of the processing apparatus 662. For example, theprocessing apparatus 662 may include one or more DRAM modules such asdouble data rate synchronous dynamic random-access memory (DDR SDRAM).In some implementations, the processing apparatus 662 may include adigital signal processor (DSP). In some implementations, the processingapparatus 662 may include an application specific integrated circuit(ASIC). For example, the processing apparatus 662 may include an imagesignal processor. The processing apparatus 662 may exchange data (e.g.,image data) with other components of the personal computing device 660via the bus 668.

The personal computing device 660 may include the user interface 664.For example, the user interface 664 may include a touchscreen displayfor presenting images and/or messages to a user and receiving commandsfrom a user. For example, the user interface 664 may include a button orswitch enabling a person to manually turn the personal computing device660 on and off. In some implementations, commands (e.g., start recordingvideo, stop recording video, or snap photograph) received via the userinterface 664 may be passed on to the image capture device 640 via thecommunications link 650. In some implementations, the user interface 664includes a touchscreen display. For example, the touchscreen display maybe configured to present a user interface including a slider interfacefor selecting an adjustable field of view for the image capture device610. For example, a field of view selection may be received via theslider interface. In some implementations, a low power preview mode maybe terminated responsive to detection of a framing event based on touchdata from the touchscreen display.

FIG. 7 is a block diagram of an example of an image processing pipeline700 with an adjustable field of view for capturing images. In thisexample, the image processing pipeline 700 implements smooth field ofview adjustment, which may also be referred to a digital zoom functionfor encoding images with a 1080p encode resolution. The image processingpipeline 700 includes an image sensor 710, a RAW domain crop module 712,a RAW domain scale module 714, a YUV domain crop module 716, a YUVdomain scale module 718, and an encoder 720. The image sensor 710 isconfigured to oversample to obtain an image 722 (e.g., a still image ora frame of video) at a capture resolution (e.g., 3840×2160 pixels) thatis greater than an encode resolution (e.g., 1920×1080 pixels) used bythe encoder 720. The captured image 722 may have a full field of view(FOV) for the image sensor 710, i.e., a largest available field of viewusing the image sensor (e.g., 118 degrees horizontal field of view(HFOV)). The image processing pipeline 700 may be configured todynamically adjust the field of view of the images (e.g., 730, 760, and790) that are passed to the encoder at the encode resolution (e.g.1920×1080 pixels) for encoding. The dynamic adjustment of the field ofview may be implemented by adjusting parameters of the RAW domain cropmodule 712, the RAW domain scale module 714, the YUV domain crop module716, and/or the YUV domain scale module 718 to effect a change in fieldof view by cropping and/or downscaling the image, while avoidingupscaling the image. FIG. 7 illustrates three configurations of theimage processing pipeline 700, corresponding to the three rows in FIG.7, that are configured to realize three different field of view settings(e.g., 118 degrees HFOV, 85 degrees HFOV, and 62 degrees HFOV) taken asexamples from a range of available field of view settings from a widestfield of view (e.g., 118 degrees HFOV) to a narrowest field of view(e.g., 62 degrees HFOV). Smooth field of view adjustment or zoom isachieved by passing the oversampled image at the full capture resolution(e.g., 3840×2160 pixels) without scaling in a RAW domain, and croppingand down-scaling in a YUV domain to obtain an image at the encoderesolution (e.g. 1920×1080 pixels) with the desired field of view.

In the example of the image processing pipeline 700, the field of viewis dynamically adjusted by adjusting settings of the YUV domain cropmodule 716 to crop to a desired field of view and adjusting settings ofthe YUV domain scale module 718 in a corresponding manner to perform anydown-scaling to keep the encode resolution for images passed to theencoder 720 consistent. For example, the image processing pipeline 700may have sufficient computational bandwidth and may lack a scaling ratiolimitation, which may allow all cropping and scaling to be performed inthe YUV domain after initial processing (e.g., correction of deadpixels, band processing, decoupling of vertical blanking, and/or noisereduction processing) is performed in the RAW domain at the highercapture resolution (e.g., 3840×2160 pixels). By avoiding upscaling,image quality of the encoded images may be enhanced compared toconventional zoom or field of view adjustment systems that useupscaling, since upscaling may introduce significant blurring or otherdistortion to images. For example, the image processing pipeline 700 maybe included in the image capture device 610 of FIG. 6A. For example, theimage processing pipeline 700 may be included in the system 630 of FIG.6B. In some implementations, the image processing pipeline 700 may beincluded in an image capture apparatus, such as the image captureapparatus 110 shown in FIG. 1 or the image capture apparatus 300 shownin FIG. 3. For example, the image processing pipeline 700 may be used toimplement the process 900 of FIG. 9.

The image processing pipeline 700 includes an image sensor 710 that isused to oversample to obtain an image 722 (e.g., a frame of video) at acapture resolution (e.g., 3840×2160 pixels) that is greater than anencode resolution (e.g., 1920×1080 pixels) used by the encoder 720. Forexample, the image sensor 710 may include the image sensor 230, theimage sensor 614, or the image sensor 642. In some implementations, theimage 722 is passed from the image sensor 710 to an image signalprocessor for processing prior to encode. For example, the processingperformed by the image signal processor may include cropping the imageusing a crop setting to obtain a cropped image (e.g., 728, 758, or 788),and down-scaling the cropped image to obtain a scaled image (e.g., 730,760, or 790) at the encode resolution.

The RAW domain crop module 712 is disabled or configured to pass theimage 722 without changing the field of view or the resolution. The RAWdomain scale module 714 is disabled or configured to pass the image 722without changing the field of view or the resolution. The image 722 thatis output from the RAW domain scale module 714 and input to the YUVdomain crop module (716, 746, and 776) may be subject to interveningprocessing not impacting the size of the image 722. For example, theimage 722 is subject to RAW to YUV conversion (e.g., using the RAW toYUV unit 550) may be subject additional image processing (e.g., imageprocessing described in relation to the image signal processor 500). TheYUV domain crop module (716, 746, and 776) is configured to allowdynamic adjustment of one or more cropping settings to adjust a field ofview of the image 722 by cropping the image 722. The YUV domain scalemodule (718, 748, and 778) is configured to allow dynamic adjustment ofone or more scaling settings to adjust a resolution of the cropped imageby down-scaling the cropped image to the encode resolution.

In a first configuration, corresponding to the first row in FIG. 7, theYUV domain crop module 716 is configured to pass the image 722 (e.g.,118 degrees HFOV and 3840×2160 pixels) without changing the field ofview or the resolution. The YUV domain scale module 718 is configured todown-scale the image 722 to obtain a scaled image 730 with the widestfield of view at the encode resolution (e.g., 118 degrees HFOV and1920×1080 pixels). The scaled image 730 is passed to the encoder 720 forencoding (e.g., JPEG, MPEG, or VP9 encoding).

In a second configuration, corresponding to the second row in FIG. 7,the YUV domain crop module 746 is configured with one or more cropsettings to crop the image 722 to obtain a cropped image 758 with afield of view reduced to an intermediate field of view (e.g., 85 degreeHFOV and 2704×1520 pixels). The YUV domain scale module 748 isconfigured to down-scale the cropped image 758 to obtain a scaled image760 with the intermediate field of view at the encode resolution (e.g.,85 degrees HFOV and 1920×1080 pixels). The scaled image 760 is passed tothe encoder 720 for encoding (e.g., JPEG, MPEG, or VP9 encoding).

In a third configuration, corresponding to the third row in FIG. 7, theYUV domain crop module 776 is configured with one or more crop settingsto crop the image 722 to obtain a cropped image 788 with a field of viewreduced to an narrowest field of view (e.g., 62 degree HFOV and1920×1080 pixels). The YUV domain scale module 778 is configured todown-scale the cropped image 788 to obtain a scaled image 790 with thenarrowest field of view at the encode resolution (e.g., 62 degrees HFOVand 1920×1080 pixels). The scaled image 760 is passed to the encoder 720for encoding (e.g., JPEG, MPEG, or VP9 encoding).

For example, the encoder 720 may include the encoder 420 of FIG. 4.

Other configurations (not shown in FIG. 7) of the YUV domain crop moduleand the YUV domain scale module may be used to dynamically obtain imagesat additional intermediate fields of view from along the range betweenthe widest available field of view and the narrowest available field ofview.

In some implementations, a combination of RAW domain cropping and/orscaling and YUV domain cropping and/or scaling is used to obtain imageswith a desired field of view at an encode resolution, while avoidingup-scaling to achieve smooth field of view adjustment/zoom. For example,a system may have a scaling ratio limitation in the YUV domain. Forexample, a YUV down-scaling ratio may be constrained to be less than1.75×, meaning the incoming resolution can be up to 1.75 times theoutgoing resolution.

FIG. 8 is a block diagram of an example of an image processing pipeline800 with an adjustable field of view for capturing images with a limitedYUV domain down-scaling ratio. In this example, the image processingpipeline 800 implements smooth field of view adjustment, which may alsobe referred to a digital zoom function for encoding images with a 1080pencode resolution. The image processing pipeline 800 includes an imagesensor 810, a RAW domain crop module 812, a RAW domain scale module 814,a YUV domain crop module 816, a YUV domain scale module 818, and anencoder 820. The image sensor 810 is configured to oversample to obtainan image 822 (e.g., a still image or a frame of video) at a captureresolution (e.g., 3840×2160 pixels) that is greater than an encoderesolution (e.g., 1920×1080 pixels) used by the encoder 820. Thecaptured image 822 may have a full field of view (FOV) for the imagesensor 810, i.e., a largest available field of view using the imagesensor (e.g., 118 degrees horizontal field of view (HFOV)). The imageprocessing pipeline 800 may be configured to dynamically adjust thefield of view of the images (e.g., 830, 860, and 890) that are passed tothe encoder at the encode resolution (e.g. 1920×1080 pixels) forencoding. The dynamic adjustment of the field of view may be implementedby adjusting parameters of the RAW domain crop module 812, the RAWdomain scale module 814, the YUV domain crop module 816, and/or the YUVdomain scale module 818 to effect a change in field of view by croppingand/or downscaling the image, while avoiding upscaling the image. FIG. 8illustrates three configurations of the image processing pipeline 800,corresponding to the three rows in FIG. 8, that are configured torealize three different field of view settings (e.g., 118 degrees HFOV,85 degrees HFOV, and 62 degrees HFOV) taken as examples from a range ofavailable field of view settings from a widest field of view (e.g., 118degrees HFOV) to a narrowest field of view (e.g., 62 degrees HFOV).Smooth field of view adjustment or zoom is achieved by cropping theoversampled image at the full capture resolution (e.g., 3840×2160pixels) and down-scaling in a RAW domain, and/or cropping anddown-scaling in a YUV domain to obtain an image at the encode resolution(e.g. 1920×1080 pixels) with the desired field of view.

In the example of the image processing pipeline 800, the field of viewis dynamically adjusted by adjusting settings of the RAW domain cropmodule 812 to crop to a desired field of view and adjusting settings ofthe RAW domain scale module 814 in a corresponding manner to perform anydown-scaling to keep an intermediate resolution for images passed todownstream processing of the image processing pipeline 800 (includingthe YUV domain scale module 818) consistent, and/or adjusting settingsof the YUV domain crop module 816 to crop to a desired field of view andadjusting settings of the YUV domain scale module 818 in a correspondingmanner to perform any downscaling to get to keep the encode resolutionfor images passed to the encoder 820 consistent. For example, the imageprocessing pipeline 800 may have computational bandwidth limitationsand/or may a YUV scaling ratio limitation, which may be overcome byperforming some cropping and scaling in the RAW domain before initialimage processing (e.g., correction of dead pixels, band processing,decoupling of vertical blanking, and/or noise reduction processing) isperformed in the RAW domain at the intermediate resolution (e.g.,2704×1520 pixels). By avoiding upscaling, image quality of the encodedimages may be enhanced compared to conventional zoom or field of viewadjustment systems that use upscaling, since upscaling may introducesignificant blurring or other distortion to images. For example, theimage processing pipeline 800 may be included in the image capturedevice 610 of FIG. 6A. For example, the image processing pipeline 800may be included in the system 630 of FIG. 6B. In some implementations,the image processing pipeline 800 may be included in an image captureapparatus, such as the image capture apparatus 110 shown in FIG. 1 orthe image capture apparatus 300 shown in FIG. 3. For example, the imageprocessing pipeline 800 may be used to implement the process 900 of FIG.9. For example, the image processing pipeline 800 may be used toimplement the process 1000 of FIG. 10.

The image processing pipeline 800 includes an image sensor 810 that isused to oversample to obtain an image 822 (e.g., a frame of video) at acapture resolution (e.g., 3840×2160 pixels) that is greater than anencode resolution (e.g., 1920×1080 pixels) used by the encoder 820. Forexample, the image sensor 810 may include the image sensor 230, theimage sensor 614, or the image sensor 642. In some implementations, theimage 822 is passed from the image sensor 810 to an image signalprocessor for processing prior to encode. For example, the processingperformed by the image signal processor may include cropping the imageusing a crop setting to obtain a cropped image (e.g., 856 or 888), anddown-scaling the cropped image to obtain a scaled image (e.g., 830, 860,or 890) at the encode resolution.

The RAW domain crop module (812 and 842) is configured to allow dynamicadjustment of one or more cropping settings to adjust a field of view ofthe image 822 by cropping the image 822. The RAW domain scale module(814 and 844) is configured to allow dynamic adjustment of one or morescaling settings to adjust a resolution of the cropped image bydown-scaling the cropped image to an intermediate resolution (e.g.,2704×1520 pixels) that satisfies a constraint on the resolutionsupported by downstream components of the image processing pipeline 800,including the YUV domain scale module 818. The YUV domain crop module(816 and 886) is configured to allow dynamic adjustment of one or morecropping settings to adjust a field of view of the image (826 or 856) bycropping the image. The YUV domain scale module (818 and 878) isconfigured to allow dynamic adjustment of one or more scaling settingsto adjust a resolution of the cropped image by down-scaling the croppedimage to the encode resolution (e.g., 1920×1080 pixels).

In a first configuration, corresponding to the first row in FIG. 8, theRAW domain crop module 812 is disabled or configured to pass the image822 without changing the field of view or the resolution. The RAW domainscale module 814 is configured to down-scale the image 822 to obtain aRAW scaled image 826 at the intermediate resolution. The image 826 thatis output from the RAW domain scale module 814 and input to the YUVdomain crop module 816 may be subject to intervening processing notimpacting the size of the image. For example, the image 826 is subjectto RAW to YUV conversion (e.g., using the RAW to YUV unit 550) and maybe subject additional image processing (e.g., image processing describedin relation to the image signal processor 500). The YUV domain cropmodule 816 is configured to pass the image 826 (e.g., 118 degrees HFOVand 2704×1520 pixels) without changing the field of view or theresolution. The YUV domain scale module 818 is configured to down-scalethe image 826 to obtain a scaled image 830 with the widest field of viewat the encode resolution (e.g., 118 degrees HFOV and 1920×1080 pixels).The scaled image 830 is passed to the encoder 820 for encoding (e.g.,JPEG, MPEG, or VP9 encoding).

In a second configuration, corresponding to the second row in FIG. 8,the RAW domain crop module 842 is configured with one or more cropsettings to crop the image 822 to obtain a RAW cropped image 854 with afield of view reduced to an intermediate field of view (e.g., 85 degreeHFOV and 2704×1520 pixels). The RAW domain scale module 844 isconfigured to down-scale the RAW cropped image 854 to obtain a RAWscaled image 856 at the intermediate resolution (e.g., 85 degrees HFOVand 2704×1520 pixels). The image 826 that is output from the RAW domainscale module 844 and input to the YUV domain crop module 816 may besubject to intervening processing not impacting the size of the image.For example, the image 856 is subject to RAW to YUV conversion (e.g.,using the RAW to YUV unit 550) and may be subject additional imageprocessing (e.g., image processing described in relation to the imagesignal processor 500). The YUV domain crop module 816 is configured topass the image 856 (e.g., 85 degrees HFOV and 2704×1520 pixels) withoutchanging the field of view or the resolution. The YUV domain scalemodule 818 is configured to down-scale the cropped image 858 to obtain ascaled image 860 with the intermediate field of view at the encoderesolution (e.g., 85 degrees HFOV and 1920×1080 pixels). The scaledimage 860 is passed to the encoder 820 for encoding (e.g., JPEG, MPEG,or VP9 encoding).

In a third configuration, corresponding to the third row in FIG. 8, theRAW domain crop module 842 is configured with one or more crop settingsto crop the image 822 to obtain a RAW cropped image 854 with a field ofview reduced to an intermediate field of view (e.g., 85 degree HFOV and2704×1520 pixels). The RAW domain scale module 844 is configured todown-scale the RAW cropped image 854 to obtain a RAW scaled image 856 atthe intermediate resolution (e.g., 85 degrees HFOV and 2704×1520pixels). The image 826 that is output from the RAW domain scale module844 and input to the YUV domain crop module 816 may be subject tointervening processing not impacting the size of the image. For example,the image 856 is subject to RAW to YUV conversion (e.g., using the RAWto YUV unit 550) and may be subject additional image processing (e.g.,image processing described in relation to the image signal processor500). The YUV domain crop module 876 is configured with one or more cropsettings to crop the RAW scaled image 856 to obtain a cropped image 888with a field of view reduced to a narrowest field of view (e.g., 62degree HFOV and 1920×1080 pixels). The YUV domain scale module 878 isconfigured to down-scale the cropped image 888 to obtain a scaled image890 with the narrowest field of view at the encode resolution (e.g., 62degrees HFOV and 1920×1080 pixels). The scaled image 860 is passed tothe encoder 820 for encoding (e.g., JPEG, MPEG, or VP9 encoding).

For example, the encoder 820 may include the encoder 420 of FIG. 4.

For example, while zooming/changing field of view between 118 degreesHFOV and 85 degrees HFOV, a field of view selection (e.g., a zoom ratio)coming from a user-interface may be converted to a crop setting (e.g., aconfiguration parameter specifying a crop resolution). A RAW crop blockand RAW scaler may be programmed dynamically to implement the field ofview adjustment. For example, the field of view adjustment may beimplemented dynamically by switching from the first row to the secondrow of FIG. 8 dynamically and keep using flow of the second row until an85 degree HFOV is achieved.

For example, while zooming/changing field of view between 85 degreesHFOV and 62 degrees HFOV, a field of view selection (e.g., a zoom ratio)coming from a user-interface may be converted to a crop setting (e.g., aconfiguration parameter specifying a crop resolution). A YUV crop blockand a YUV scaler mat be programmed dynamically to implement the field ofview adjustment. For example, the field of view adjustment may beimplemented dynamically by switching from the second row to the thirdrow of FIG. 8 dynamically and keep using flow of the third row until a62 degree HFOV is achieved.

Other configurations (not shown in FIG. 8) of the YUV domain crop moduleand the YUV domain scale module may be used to dynamically obtain imagesat additional intermediate fields of view from along the range betweenthe widest available field of view and the narrowest available field ofview.

FIG. 9 is a flowchart of an example of a process 900 for adjusting afield of view for an image capture device. The process includesreceiving 910 a field of view selection; oversampling 920 to obtain animage at a capture resolution that is greater than an encode resolution;determining 930 a crop setting based on the field of view selection;cropping 940 the image using the crop setting to obtain a cropped image;down-scaling 950 the cropped image to obtain a scaled image at theencode resolution; encoding 960 the scaled image at the encoderesolution; and storing, displaying, or transmitting 970 an output imagebased on the scaled image. For example, the process 900 may beimplemented by the system 600 of FIG. 6A or the system 630 of FIG. 6B.For example, the process 900 may be implemented by an image capturedevice, such the image capture device 610 shown in FIG. 6A, or an imagecapture apparatus, such as the image capture apparatus 110 shown in FIG.1 or the image capture apparatus 300 of FIG. 3. For example, the process900 may be implemented by a personal computing device, such as thepersonal computing device 660. For example, the process 900 may beimplemented using a processing apparatus (e.g., the processing apparatus612) that includes an image signal processor (e.g., the image signalprocessor 500). For example, the process 900 may be implemented by theimage processing pipeline 700 of FIG. 7. For example, the process 900may be implemented by the image processing pipeline 800 of FIG. 8.

The process 900 includes receiving 910 receiving a field of viewselection. For example, the field of view selection may be received 910via a user interface (e.g., the user interface unit 212, the userinterface 620, or the user interface 664). In some implementations, aslider interface may be presented via a touchscreen display, and thefield of view selection is received 910 via the touchscreen displaybased on a user interaction with the slider interface. In someimplementations, the field of view selection may be received 910 via acommunications link (e.g., the communications link 650). For example,the field of view selection may be received 910 via a drop down menupresented in a user interface. For example, the field of view selectionmay be received 910 as text entered in a user interface. For example,the field of view selection may be received 910 via a wireless or wiredcommunications interface (e.g., Wi-Fi, Bluetooth, USB, HDMI, WirelessUSB, Near Field Communication (NFC), Ethernet, a radio frequencytransceiver, and/or other interfaces). For example, the field of viewselection may be received 910 via communications interface 666.

The process 900 includes oversampling 920, using an image sensor, toobtain an image (e.g., a still image or a frame of video) at a captureresolution that is greater than an encode resolution. For example, thecapture resolution may be 3840×2160 pixels and the encode resolution maybe 1920×1080 pixels, which may enable high quality digital zoom up to 2×without up-scaling. The image sensor may be part of an image captureapparatus (e.g., the image capture apparatus 110, the image captureapparatus 300, the image capture device 610, or the image capture device640). For example, the image may be received from the image sensor via abus (e.g., the bus 624 or image signal processor bus 590). For example,the image may be received via a front ISP (e.g., the front ISP 510) thatperforms some initial processing on the received image. For example, theimage may represent each pixel value in a defined format, such as in aRAW image signal format. For example, the image be stored in a formatusing the Bayer color mosaic pattern.

The process 900 includes determining 930 a crop setting based on thefield of view selection. For example, the crop setting may be for a RAWdomain crop. For example, the crop setting may be for a YUV domain crop.In some implementations, multiple crop settings are determined thatspecify a RAW domain crop and YUV domain crop that jointly achieve afield of view specified by the field of view selection. For example, acrop setting may specify a portion of a horizontal field of view thatwill be cropped out or a portion of a vertical field of view that willbe cropped out. For example, a crop setting may explicitly or implicitlyspecify a crop operation that will maintain the aspect ratio of theimage.

The process 900 includes cropping 940 the image using the crop settingto obtain a cropped image. For example, cropping 940 the image using thecrop setting may include cropping the image in a RAW domain, beforeconversion to a YUV domain. For example, cropping 940 the image usingthe crop setting may include cropping the image in a YUV domain, afterconversion from a RAW domain. In some implementations, pixels around theedges of the image are cropped 940 out to retain a portion in the centerof the image corresponding to a narrower field of view. The aspect ratioof the image may be preserved by the cropping 940. For example, theprocess 1000 of FIG. 10 may be implemented to determine 930 the cropsetting and crop 940 the image using the crop setting.

The process 900 includes down-scaling 950 the cropped image to obtain ascaled image at the encode resolution (e.g., 1920×1080 pixels). Forexample, down-scaling 950 the cropped image may include down-scaling thecropped image in a YUV domain, after conversion from a RAW domain. Forexample, down-scaling 950 the cropped image may include down-scaling thecropped image in a RAW domain, before conversion from a YUV domain. Forexample, an image signal processor may be used to crop 940 the imageusing the crop setting to obtain a cropped image, and down-scale 950 thecropped image to obtain a scaled image at the encode resolution.

The process 900 includes encoding 960 the scaled image at the encoderesolution (e.g., 1920×1080 pixels) to obtain an encoded image. Forexample, encoding 960 the scaled image may include performing a JPEGencoding, an MPEG encoding, or a VP9 encoding. For example, the encoder420 of FIG. 4 may be used to encode 960 the scaled image at the encoderesolution.

The process 900 includes storing, displaying, or transmitting 970 anoutput image based on the scaled image. For example, the output imagemay be a preview image that is displayed 970 on a display of an imagecapture device (e.g., a display of the user interface unit 212, adisplay of the user interface 620, or a display of the user interface664). For example, the output image may be the encoded image. Forexample, the process 900 may include storing or transmitting 970 theencoded image. For example, the output image (e.g., the encoded image)may be transmitted 970 to an external device (e.g., a personal computingdevice) for display or storage. For example, the output image may betransmitted 970 via the communications interface 618. For example, theoutput image (e.g., the encoded image) may be stored 970 in memory ofthe processing apparatus 612 or in memory of the processing apparatus662.

Performing the process 900 on an image capture device may consume morepower than a low power preview mode of the device. For example, imagesfrom the image sensor may be up-scaled during the low power previewmode, which may enable image processing of the images at lowerresolutions to consume less power. It may be advantageous to run in thelow power preview mode when the image capture device is not activelybeing used for image capture. In some implementations, the process 900may be initiated when a low power preview mode is terminated responsiveto the detection of an event indicating active use of the image capturedevice. For example, a low power preview mode may be terminatedresponsive to detection of a framing event based on touch data from atouchscreen display (e.g., a touchscreen display of the user interfaceunit 212, the user interface 620, or the user interface 664). In someimplementations, an image capture device may include a gyroscope, andthe low power preview mode may be terminated responsive to detection ofa framing event based on angular rate data from a gyroscope.

FIG. 10 is a flowchart of an example of a process 1000 for determiningcrop settings based on a field of view selection. The process 1000includes determining 1010 a first crop setting for a RAW domain cropbased on the field of view selection; cropping 1020 the image in a RAWdomain; convert 1030 the cropped image from the RAW domain to a YUVdomain; determining 1040 a second crop setting for a YUV domain cropbased on the field of view selection and the first crop setting; andcropping 1050 the image in the YUV domain using the second crop setting.For example, the process 1000 may be implemented by the system 600 ofFIG. 6A or the system 630 of FIG. 6B. For example, the process 1000 maybe implemented by an image capture device, such the image capture device610 shown in FIG. 6A, or an image capture apparatus, such as the imagecapture apparatus 110 shown in FIG. 1 or the image capture apparatus 300of FIG. 3. For example, the process 1000 may be implemented by apersonal computing device, such as the personal computing device 660.For example, the process 1000 may be implemented using a processingapparatus (e.g., the processing apparatus 612) that includes an imagesignal processor (e.g., the image signal processor 500). For example,the process 1000 may be implemented by the image processing pipeline 800of FIG. 8.

The process 1000 includes determining 1010 a first crop setting for aRAW domain crop based on the field of view selection. For example, thefirst crop setting may be set to a value that will reduce an image to anintermediate resolution (e.g., 2704×1520 pixels) that is supported by animage processing pipeline (e.g., an image processing pipelineimplemented with an image signal processor), where cropping to achieve afield of view specified by the field of view selection would reduce theresolution of the image below the intermediate resolution.

The process 1000 includes cropping 1020 the image in a RAW domain,before conversion to a YUV domain. The image may be cropped 1020 usingthe first crop setting. In some implementations, pixels around the edgesof the image are cropped 1020 out to retain a portion in the center ofthe image corresponding to a narrower field of view. The aspect ratio ofthe image may be preserved by the cropping 1020.

The process includes converting 1030 the cropped image from the RAWdomain to a YUV domain. For example, a linear transformation (e.g., amatrix multiplication) may be applied to the pixel values of the croppedimage to convert 1030 the cropped image from the RAW domain (e.g., anRGB representation) to the YUV domain.

The process 1000 includes determining 1040 a second crop setting for aYUV domain crop based on the field of view selection and the first cropsetting. For example, where the image has been cropped 1020 in the RAWdomain using the first crop setting to an intermediate resolution (e.g.,2704×1520 pixels) and image processing, including the conversion to theYUV domain and possible additional processing (e.g., noise reductionand/or local tone mapping), is performed on the image data at theintermediate resolution, the second crop setting may be determined 1040to reduce the field of view of the image further to a field of viewspecified by the field of view selection.

The process 1000 includes cropping 1050 the image in the YUV domainusing the second crop setting, after conversion from the RAW domain. Theimage may be cropped 1050 using the second crop setting. In someimplementations, pixels around the edges of the image are cropped 1050out to retain a portion in the center of the image corresponding to anarrower field of view that is specified by the field of view selection.The aspect ratio of the image may be preserved by the cropping 1050.

Where certain elements of these implementations may be partially orfully implemented using known components, those portions of such knowncomponents that are necessary for an understanding of the presentdisclosure have been described, and detailed descriptions of otherportions of such known components have been omitted so as not to obscurethe disclosure.

In the present specification, an implementation showing a singularcomponent should not be considered limiting; rather, the disclosure isintended to encompass other implementations including a plurality of thesame component, and vice-versa, unless explicitly stated otherwiseherein.

Further, the present disclosure encompasses present and future knownequivalents to the components referred to herein by way of illustration.

As used herein, the term “bus” is meant generally to denote any type ofinterconnection or communication architecture that may be used tocommunicate data between two or more entities. The “bus” could beoptical, wireless, infrared or another type of communication medium. Theexact topology of the bus could be, for example, standard “bus,”hierarchical bus, network-on-chip, address-event-representation (AER)connection, or other type of communication topology used for accessing,e.g., different memories in a system.

As used herein, the terms “computer,” “computing device,” and“computerized device” include, but are not limited to, personalcomputers (PCs) and minicomputers (whether desktop, laptop, orotherwise), mainframe computers, workstations, servers, personal digitalassistants (PDAs), handheld computers, embedded computers, programmablelogic devices, personal communicators, tablet computers, portablenavigation aids, Java 2 Platform, Micro Edition (J2ME) equipped devices,cellular telephones, smart phones, personal integrated communication orentertainment devices, or literally any other device capable ofexecuting a set of instructions.

As used herein, the term “computer program” or “software” is meant toinclude any sequence of machine cognizable steps which perform afunction. Such program may be rendered in virtually any programminglanguage or environment including, for example, C/C++, C#, Fortran,COBOL, MATLAB™, PASCAL, Python, assembly language, markup languages(e.g., HTML, Standard Generalized Markup Language (SGML), XML, VoiceMarkup Language (VoxML)), as well as object-oriented environments suchas the Common Object Request Broker Architecture (CORBA), Java™(including J2ME, Java Beans), and/or Binary Runtime Environment (e.g.,Binary Runtime Environment for Wireless (BREW)).

As used herein, the terms “connection,” “link,” “transmission channel,”“delay line,” and “wireless” mean a causal link between any two or moreentities (whether physical or logical/virtual) which enables informationexchange between the entities.

As used herein, the terms “integrated circuit,” “chip,” and “IC” aremeant to refer to an electronic circuit manufactured by the patterneddiffusion of trace elements into the surface of a thin substrate ofsemiconductor material. By way of non-limiting example, integratedcircuits may include field programmable gate arrays (e.g., FPGAs), aprogrammable logic device (PLD), reconfigurable computer fabrics (RCFs),systems on a chip (SoC), application-specific integrated circuits(ASICs), and/or other types of integrated circuits.

As used herein, the term “memory” includes any type of integratedcircuit or other storage device adapted for storing digital data,including, without limitation, read-only memory (ROM), programmable ROM(PROM), electrically erasable PROM (EEPROM), dynamic random accessmemory (DRAM), Mobile DRAM, synchronous DRAM (SDRAM), Double Data Rate 2(DDR/2) SDRAM, extended data out (EDO)/fast page mode (FPM), reducedlatency DRAM (RLDRAM), static RAM (SRAM), “flash” memory (e.g.,NAND/NOR), memristor memory, and pseudo SRAM (PSRAM).

As used herein, the terms “microprocessor” and “digital processor” aremeant generally to include digital processing devices. By way ofnon-limiting example, digital processing devices may include one or moreof digital signal processors (DSPs), reduced instruction set computers(RISC), general-purpose complex instruction set computing (CISC)processors, microprocessors, gate arrays (e.g., field programmable gatearrays (FPGAs)), PLDs, reconfigurable computer fabrics (RCFs), arrayprocessors, secure microprocessors, application-specific integratedcircuits (ASICs), and/or other digital processing devices. Such digitalprocessors may be contained on a single unitary IC die or distributedacross multiple components.

As used herein, the term “network interface” refers to any signal, data,and/or software interface with a component, network, and/or process. Byway of non-limiting example, a network interface may include one or moreof FireWire (e.g., FW400, FW110, and/or other variations), USB (e.g.,USB2), Ethernet (e.g., 10/100, 10/100/1000 (Gigabit Ethernet), 10-Gig-E,and/or other Ethernet implementations), MoCA, Coaxsys (e.g., TVnet™),radio frequency tuner (e.g., in-band or out-of-band, cable modem, and/orother radio frequency tuner protocol interfaces), Wi-Fi (802.11), WiMAX(802.16), personal area network (PAN) (e.g., 802.15), cellular (e.g.,3G, LTE/LTE-A/TD-LTE, GSM, and/or other cellular technology), IrDAfamilies, and/or other network interfaces.

As used herein, the term “Wi-Fi” includes one or more of IEEE-Std.802.11, variants of IEEE-Std. 802.11, standards related to IEEE-Std.802.11 (e.g., 802.11 a/b/g/n/s/v), and/or other wireless standards.

As used herein, the term “wireless” means any wireless signal, data,communication, and/or other wireless interface. By way of non-limitingexample, a wireless interface may include one or more of Wi-Fi,Bluetooth, 3G (3GPP/3GPP2), High Speed Downlink Packet Access/High SpeedUplink Packet Access (HSDPA/HSUPA), Time Division Multiple Access(TDMA), Code Division Multiple Access (CDMA)(e.g., IS-95A, Wideband CDMA(WCDMA), and/or other wireless technology), Frequency Hopping SpreadSpectrum (FHSS), Direct Sequence Spread Spectrum (DSSS), Global Systemfor Mobile communications (GSM), PAN/802.15, WiMAX (802.16), 802.20,narrowband/Frequency Division Multiple Access (FDMA), OrthogonalFrequency Division Multiplex (OFDM), Personal Communication Service(PCS)/Digital Cellular System (DCS), LTE/LTE-Advanced (LTE-A)/TimeDivision LTE (TD-LTE), analog cellular, cellular Digital Packet Data(CDPD), satellite systems, millimeter wave or microwave systems,acoustic, infrared (i.e., IrDA), and/or other wireless interfaces.

As used herein, the terms “camera,” or variations thereof, and “imagecapture device,” or variations thereof, may be used to refer to anyimaging device or sensor configured to capture, record, and/or conveystill and/or video imagery which may be sensitive to visible parts ofthe electromagnetic spectrum, invisible parts of the electromagneticspectrum (e.g., infrared, ultraviolet), and/or other energy (e.g.,pressure waves).

While certain aspects of the technology are described in terms of aspecific sequence of steps of a method, these descriptions areillustrative of the broader methods of the disclosure and may bemodified by the particular application. Certain steps may be renderedunnecessary or optional under certain circumstances. Additionally,certain steps or functionality may be added to the disclosedimplementations, or the order of performance of two or more steps may bepermuted. All such variations are considered to be encompassed withinthe disclosure.

While the above-detailed description has shown, described, and pointedout novel features of the disclosure as applied to variousimplementations, it will be understood that various omissions,substitutions, and changes in the form and details of the devices orprocesses illustrated may be made by those skilled in the art withoutdeparting from the disclosure. The foregoing description is in no waymeant to be limiting, but rather should be taken as illustrative of thegeneral principles of the technology.

What is claimed is:
 1. An image capture device comprising: an imagesensor configured to capture images; a touchscreen display configured topresent a user interface including a slider interface; and a processingapparatus configured to: receive a field of view selection via theslider interface; oversample, using the image sensor, to obtain an imageat a capture resolution that is greater than an encode resolution;determine a crop setting for a RAW domain crop based on the field ofview selection; crop the image in a RAW domain using the crop setting toobtain a cropped image; convert the cropped image from the RAW domain toa YUV domain; down-scale the cropped image in the YUV domain to obtain ascaled image at the encode resolution; encode the scaled image at theencode resolution to obtain an encoded image; and store or transmit theencoded image.
 2. The image capture device of claim 1, in which theprocessing apparatus is configured to: down-scale the cropped image inthe RAW domain to an intermediate resolution, before conversion to theYUV domain.
 3. The image capture device of claim 1, in which the cropsetting is a first crop setting, and the processing apparatus isconfigured to: determine a second crop setting for a YUV domain cropbased on the field of view selection and the first crop setting; andcrop the image in the YUV domain using the second crop setting, afterconversion from the RAW domain.
 4. A system comprising: an image sensorconfigured to capture images; and a processing apparatus configured to:receive a field of view selection; oversample, using the image sensor,to obtain an image at a capture resolution that is greater than anencode resolution; determine a crop setting based on the field of viewselection; crop the image using the crop setting to obtain a croppedimage; down-scale the cropped image to obtain a scaled image at theencode resolution; encode the scaled image at the encode resolution toobtain an encoded image; and store or transmit the encoded image.
 5. Thesystem of claim 4, in which the crop setting is for a RAW domain crop,and the processing apparatus is configured to: crop the image in a RAWdomain, before conversion to a YUV domain.
 6. The system of claim 4, inwhich the processing apparatus is configured to: down-scale the image ina RAW domain to an intermediate resolution, before conversion to a YUVdomain.
 7. The system of claim 4, in which the field of view selectionis selected from among three different horizontal field of view settingsto dynamically adjust a horizontal field of view of the encoded image.8. The system of claim 4, in which the processing apparatus isconfigured to: down-scale the cropped image in a YUV domain, afterconversion from a RAW domain.
 9. The system of claim 4, comprising atouchscreen display, and in which the processing apparatus is configuredto: terminate a low power preview mode responsive to detection of aframing event based on touch data from the touchscreen display, in whichimages from the image sensor are up-scaled during the low power previewmode.
 10. The system of claim 4, comprising a gyroscope, and in whichthe processing apparatus is configured to: terminate a low power previewmode responsive to detection of a framing event based on angular ratedata from the gyroscope, in which images from the image sensor areup-scaled during the low power preview mode.
 11. The system of claim 4,comprising a touchscreen display, and in which the processing apparatusis configured to: present a slider interface via the touchscreendisplay, in which the field of view selection is received via thetouchscreen display based on a user interaction with the sliderinterface.
 12. The system of claim 4, in which the processing apparatuscomprises an image signal processor that is configured to: crop theimage using the crop setting to obtain a cropped image; and down-scalethe cropped image to obtain a scaled image at the encode resolution. 13.A method comprising: receiving a field of view selection; oversampling,using an image sensor, to obtain an image at a capture resolution thatis greater than an encode resolution; determining a crop setting basedon the field of view selection; cropping the image using the cropsetting to obtain a cropped image; down-scaling the cropped image toobtain a scaled image at the encode resolution; encoding the scaledimage at the encode resolution; and storing, displaying, or transmittingan output image based on the scaled image.
 14. The method of claim 13,in which the crop setting is for a RAW domain crop, and cropping theimage using the crop setting to obtain a cropped image comprises:cropping the image in a RAW domain, before conversion to a YUV domain.15. The method of claim 13, comprising: down-scaling the image in a RAWdomain to an intermediate resolution, before conversion to a YUV domain.16. The method of claim 13, in which the field of view selection isselected from among three different horizontal field of view settings todynamically adjust a horizontal field of view of the output image. 17.The method of claim 13, in which down-scaling the cropped image toobtain a scaled image at the encode resolution comprises: down-scalingthe cropped image in a YUV domain, after conversion from a RAW domain.18. The method of claim 13, comprising: terminating a low power previewmode responsive to detection of a framing event based on touch data froma touchscreen display, in which images from the image sensor areup-scaled during the low power preview mode.
 19. The method of claim 13,comprising: terminating a low power preview mode responsive to detectionof a framing event based on angular rate data from a gyroscope, in whichimages from the image sensor are up-scaled during the low power previewmode.
 20. The method of claim 13, comprising: presenting a sliderinterface via a touchscreen display, in which the field of viewselection is received via the touchscreen display based on a userinteraction with the slider interface.